In this work, we give efficient algorithms for privately estimating a Gaussian distribution in both pure and approximate differential privacy (DP) models with optimal dependence on the dimension in the sample complexity. In the pure DP setting, we give an efficient algorithm that estimates an unknown $d$-dimensional Gaussian distribution up to an arbitrary tiny total variation error using $\widetilde{O}(d^2 \log \kappa)$ samples while tolerating a constant fraction of adversarial outliers. Here, $\kappa$ is the condition number of the target covariance matrix. The sample bound matches best non-private estimators in the dependence on the dimension (up to a polylogarithmic factor). We prove a new lower bound on differentially private covariance estimation to show that the dependence on the condition number $\kappa$ in the above sample bound is also tight. Prior to our work, only identifiability results (yielding inefficient super-polynomial time algorithms) were known for the problem. In the approximate DP setting, we give an efficient algorithm to estimate an unknown Gaussian distribution up to an arbitrarily tiny total variation error using $\widetilde{O}(d^2)$ samples while tolerating a constant fraction of adversarial outliers. Prior to our work, all efficient approximate DP algorithms incurred a super-quadratic sample cost or were not outlier-robust. For the special case of mean estimation, our algorithm achieves the optimal sample complexity of $\widetilde O(d)$, improving on a $\widetilde O(d^{1.5})$ bound from prior work. Our pure DP algorithm relies on a recursive private preconditioning subroutine that utilizes the recent work on private mean estimation [Hopkins et al., 2022]. Our approximate DP algorithms are based on a substantial upgrade of the method of stabilizing convex relaxations introduced in [Kothari et al., 2022].
翻译:在这项工作中,我们给出了高效的算法, 用于私下估算纯度和近似差分隐私(DP)模型中的高斯分布, 其最佳依赖度取决于抽样复杂度。 在纯 DP 设置中, 我们给出了一个高效的算法, 估计一个未知的美元维度高斯分布, 直至一个任意的微小总变差错误, 使用 $\ 广度{O} (d2\log\kappa) (d2\log\kappa) 样本, 以维持一个固定的对立差分数。 这里, $( kaptappa) 是目标变量变量变异矩阵的数。 样本结合了最佳非专用的 非专用估测算器 。 我们用一个高效的奥氏平面调值运算法, 之前用一个未知的奥氏变数计算法 。 之前, 显示对条件值的 $ kaptappa a test real droal drial drial oration a preal drial develop raltial rial rial rialtial ormax) a preal drealtistration.