Recent years have seen a strong uptick in both the prevalence and real-world consequences of false information spread through online platforms. At the same time, encrypted messaging systems such as WhatsApp, Signal, and Telegram, are rapidly gaining popularity as users seek increased privacy in their digital lives. The challenge we address is how to combat the viral spread of misinformation without compromising privacy. Our FACTS system tracks user complaints on messages obliviously, only revealing the message's contents and originator once sufficiently many complaints have been lodged. Our system is private, meaning it does not reveal anything about the senders or contents of messages which have received few or no complaints; secure, meaning there is no way for a malicious user to evade the system or gain an outsized impact over the complaint system; and scalable, as we demonstrate excellent practical efficiency for up to millions of complaints per day. Our main technical contribution is a new collaborative counting Bloom filter, a simple construction with difficult probabilistic analysis, which may have independent interest as a privacy-preserving randomized count sketch data structure. Compared to prior work on message flagging and tracing in end-to-end encrypted messaging, our novel contribution is the addition of a high threshold of multiple complaints that are needed before a message is audited or flagged. We present and carefully analyze the probabilistic performance of our data structure, provide a precise security definition and proof, and then measure the accuracy and scalability of our scheme via experimentation.
翻译:近些年来,通过在线平台传播虚假信息的普遍程度和真实世界后果都出现了强烈反响。 与此同时,当用户在数字生活中寻求增加隐私时,诸如“WhesApp”、“Signal”和“Telegram”等加密信息系统正在迅速受到欢迎。我们处理的挑战是如何在不损害隐私的情况下打击错误信息的病毒扩散。我们的FATS系统在信息上盲目地追踪用户对信息的投诉,仅仅在提出足够多的投诉后披露信息的内容和发端人。我们的系统是私人的,意思是它没有透露任何关于很少收到或没有投诉的信息的发送者或内容;安全,这意味着恶意用户无法逃避系统或对数字生活中的超大影响;我们处理的挑战是如何在不损害隐私的情况下打击错误信息传播的病毒传播。我们的主要技术贡献是一个新的协作性计算布鲁姆过滤器,这是一个简单的结构,它可能具有独立的兴趣,因为它是保密随机统计的草图数据结构。 与以前关于信息标识和追踪的信息的标记和追踪工作相比,在终端到终端的信息传递之前的准确性定义之前,我们所需要的一个最新数据分析标准是我们目前需要的精确的、精确的系统。