There are a number of forums where people participate under pseudonyms. One example is peer review, where the identity of reviewers for any paper is confidential. When participating in these forums, people frequently engage in "batching": executing multiple related tasks (e.g., commenting on multiple papers) at nearly the same time. Our empirical analysis shows that batching is common in two applications we consider $\unicode{x2013}$ peer review and Wikipedia edits. In this paper, we identify and address the risk of deanonymization arising from linking batched tasks. To protect against linkage attacks, we take the approach of adding delay to the posting time of batched tasks. We first show that under some natural assumptions, no delay mechanism can provide a meaningful differential privacy guarantee. We therefore propose a "one-sided" formulation of differential privacy for protecting against linkage attacks. We design a mechanism that adds zero-inflated uniform delay to events and show it can preserve privacy. We prove that this noise distribution is in fact optimal in minimizing expected delay among mechanisms adding independent noise to each event, thereby establishing the Pareto frontier of the trade-off between the expected delay for batched and unbatched events. Finally, we conduct a series of experiments on Wikipedia and Bitcoin data that corroborate the practical utility of our algorithm in obfuscating batching without introducing onerous delay to a system.
翻译:有一些人们以化名参与的论坛。 一个例子就是同侪审查, 任何文件的审查者的身份都是保密的。 当参加这些论坛时, 人们经常同时“ 纠缠 ” : 几乎同时执行多重相关任务( 例如对多个文件发表评论 ) 。 我们的经验分析显示, 分批处理在两种应用程序中很常见, 我们考虑的是$unicode{x2013} 美元同侪审查 和维基百科编辑。 在本文件中, 我们确定并处理因连接成批的任务而产生的去匿名化风险。 为了防止联系攻击, 我们采取了在分批任务张贴时间上增加延迟的方法。 我们首先显示, 在一些自然假设下, 没有延迟机制能够提供有意义的差异隐私保障。 因此, 我们提议“ 片面” 配制差异隐私, 来保护联系攻击。 我们设计一个机制, 给事件增加零加固的统一延误, 并显示它能够保护隐私。 我们证明, 这种噪音的传播实际上最理想的办法是最大限度地减少每件中增加独立噪音的机制之间的拖延, 从而在分批任务时,, 从而确定在预期的比级标准级标准级标准上, 我们最终开始进行实际的延迟的升级, 。