In this paper, we assess existing technical proposals for content moderation in End-to-End Encryption (E2EE) services. First, we explain the various tools in the content moderation toolbox, how they are used, and the different phases of the moderation cycle, including detection of unwanted content. We then lay out a definition of encryption and E2EE, which includes privacy and security guarantees for end-users, before assessing current technical proposals for the detection of unwanted content in E2EE services against those guarantees. We find that technical approaches for user-reporting and meta-data analysis are the most likely to preserve privacy and security guarantees for end-users. Both provide effective tools that can detect significant amounts of different types of problematic content on E2EE services, including abusive and harassing messages, spam, mis- and disinformation, and CSAM, although more research is required to improve these tools and better measure their effectiveness. Conversely, we find that other techniques that purport to facilitate content detection in E2EE systems have the effect of undermining key security guarantees of E2EE systems.
翻译:在本文中,我们评估了现有关于端到端加密(E2EE)服务内容调适的技术建议。首先,我们解释了内容调适工具箱中的各种工具,如何使用这些工具,以及调适周期的不同阶段,包括发现不想要的内容。然后,我们提出了加密和E2EE的定义,其中包括最终用户的隐私和安全保障,然后评估了目前针对E2EE服务中发现不想要的内容的技术建议,并对这些保障进行了评估。我们发现,用户报告和元数据分析的技术方法最有可能保护最终用户的隐私和安全保障。两者都提供了有效工具,能够发现E2E服务中存在大量不同种类的问题内容,包括虐待和骚扰性信息、垃圾邮件、错误和不实信息以及计算机安全管理,尽管还需要进行更多的研究,以改进这些工具并更好地衡量其有效性。相反,我们发现,旨在便利E2E系统内容检测的其他技术会破坏E2E系统的关键安全保障。