Content moderation (removing or limiting the distribution of posts based on their contents) is one tool social networks use to fight problems such as harassment and disinformation. Manually screening all content is usually impractical given the scale of social media data, and the need for nuanced human interpretations makes fully automated approaches infeasible. We consider content moderation from the perspective of technology-assisted review (TAR): a human-in-the-loop active learning approach developed for high recall retrieval problems in civil litigation and other fields. We show how TAR workflows, and a TAR cost model, can be adapted to the content moderation problem. We then demonstrate on two publicly available content moderation data sets that a TAR workflow can reduce moderation costs by 20% to 55% across a variety of conditions.
翻译:内容节制(根据内容改变或限制职位分配)是用来对付骚扰和虚假信息等问题的一个社会网络工具。鉴于社交媒体数据的规模,人工筛选所有内容通常不切实际,而需要细化的人的诠释使得完全自动化的方法不可行。我们认为,从技术辅助审查(TAR)的角度看,内容节制:为民事诉讼和其他领域的高回溯检索问题而开发的“人与人间积极学习”办法。我们展示了TAR工作流程和TAR成本模式如何适应内容节制问题。我们随后在两种公开可得到的内容节制数据集上展示,TAR工作流程可以在不同条件下将适度成本降低20%至55%。