Different from existing Differential Privacy (DP) accountants, we introduce pro-active DP. Existing DP accountants keep track of how privacy budget has been spent while pro-active DP is a scheme that allows one to {\it a-priori} select parameters of DP-SGD based on a fixed privacy budget (in terms of $\epsilon$ and $\delta$) in such a way to optimize the anticipated utility (test accuracy) the most. To implement this idea, we show how to convert the classical DP moment accountant to a pro-active DP by exploiting the fact that it has a simple close form for computing spent privacy budget for a given interaction round. The DP moment accountant is introduced in context of DP-SGD and has the following property which is the key ingredient to build pro-active DP. In DP-SGD each round communicates a local SGD update which leaks some new information about the underlying local data set to the outside world. In order to provide privacy, Gaussian noise with standard deviation $\sigma$ is added to local SGD updates after performing a clipping operation and normalizing with the clipping constant. We show that for attaining $(\epsilon,\delta)$-differential privacy $\sigma$ can be chosen equal to $\sqrt{2(\epsilon +\ln(1/\delta))/\epsilon}$ for $\epsilon=\Omega(T/N^2)$, where $T$ is the total number of rounds and $N$ is equal to the size of the local data set. In many existing machine learning problems, $N$ is always large and $T=O(N)$. Hence, $\sigma$ becomes ``independent'' of any $T=O(N)$ choice with $\epsilon=\Omega(1/N)$. This means that our {\em $\sigma$ only depends on $N$ rather than $T$}. We show how this differential privacy characterization allows us to convert DP moment accountant to a pro-active DP.
翻译:与现有的差异隐私(DP)会计不同, 我们引入了前瞻性的DP。 现有的DP会计师会跟踪隐私预算的花费情况, 而预防性的DP是一个计划, 允许根据固定的隐私预算( $\ epsilon$ 和 $\ delta$) 选择 DP- SGD 的选定参数( 以固定的隐私预算( $\ epsilon$ 和 $\ delta$ ) 优化预期的私隐( 测试准确性) 。 为了实施这个理念, 我们展示了如何将传统DP时刻的音量转换成一个主动的DP。 利用它有一个简单接近的形式, 用来计算一个特定互动回合的私隐预算( $ $ ) 。 在DP- 美元 的剪裁剪裁操作中引入了DP- $ 美元 美元 。 我们显示的是 美元 = 美元 美元 = Qrqm 的本地数据 。