隐藏在克隆人中:通过打乱对隐私放大进行简单和接近最佳的分析 (Hiding Among the Clones: A Simple and Nearly Optimal Analysis of Privacy Amplification by Shuffling)

Recent work of Erlingsson, Feldman, Mironov, Raghunathan, Talwar, and Thakurta [EFMRTT19] demonstrates that random shuffling amplifies differential privacy guarantees of locally randomized data. Such amplification implies substantially stronger privacy guarantees for systems in which data is contributed anonymously [BEMMRLRKTS17] and has lead to significant interest in the shuffle model of privacy [CSUZZ19; EFMRTT19]. We show that random shuffling of $n$ data records that are input to $\varepsilon_0$-differentially private local randomizers results in an $(O((1-e^{-\varepsilon_0})\sqrt{\frac{e^{\varepsilon_0}\log(1/\delta)}{n}}), \delta)$-differentially private algorithm. This significantly improves over previous work and achieves the asymptotically optimal dependence in $\varepsilon_0$. Our result is based on a new approach that is simpler than previous work and extends to approximate differential privacy with nearly the same guarantees. Importantly, our work also yields an algorithm for deriving tighter bounds on the resulting $\varepsilon$ and $\delta$ as well as R\'enyi differential privacy guarantees. We show numerically that our algorithm gets to within a small constant factor of the optimal bound. As a direct corollary of our analysis we derive a simple and nearly optimal algorithm for frequency estimation in the shuffle model of privacy. We also observe that our result implies the first asymptotically optimal privacy analysis of noisy stochastic gradient descent that applies to sampling without replacement.

翻译：Erlingsson、Feldman、Mironov、Raghunathan、Talwar和Thakurta最近的工作表明,随机打拼会扩大本地随机数据的不同隐私保障。这种打拼意味着对匿名提供数据的系统大大加强隐私保障[BEMMRLRKTS17],并导致对保密的打拼模式[CSUZZ19;EFMRT19]的极大兴趣。我们显示,随机打乱美元的数据记录可以输入$varepsilon_0美元差异化的本地私人随机数据记录,导致近乎(O(1-e ⁇ -\ varepslon_0})的频率隐私保障。我们的结果是以新办法(O(1-e- e-\\\ varepslon_0)] 的频率进行不同的隐私保障,也意味着我们之前的正常的汇率分析。