Differential Privacy (DP) provides an elegant mathematical framework for defining a provable disclosure risk in the presence of arbitrary adversaries; it guarantees that whether an individual is in a database or not, the results of a DP procedure should be similar in terms of their probability distribution. While DP mechanisms are provably effective in protecting privacy, they often negatively impact the utility of the query responses, statistics and/or analyses that come as outputs from these mechanisms. To address this problem, we use ideas from the area of robust statistics which aims at reducing the influence of outlying observations on statistical inference. Based on the preliminary known links between differential privacy and robust statistics, we modify the objective perturbation mechanism by making use of a new bounded function and define a bounded M-Estimator with adequate statistical properties. The resulting privacy mechanism, named "Perturbed M-Estimation", shows important potential in terms of improved statistical utility of its outputs as suggested by some preliminary results. These results consequently support the need to further investigate the use of robust statistical tools for differential privacy.
翻译:差异隐私(DP)提供了一个优雅的数学框架,用于在任意对手在场的情况下界定可证实的披露风险;它保证个人是否在数据库中,DP程序的结果在概率分布方面应当相似;虽然DP机制在保护隐私方面效果良好,但往往对作为这些机制产出的查询答复、统计和(或)分析的效用产生消极影响;为解决这一问题,我们利用来自可靠统计领域的想法,目的是减少对统计推理所依赖的观察的影响;根据差异隐私与稳健统计之间的初步已知联系,我们通过使用新的约束功能来修改客观干扰机制,并定义具有适当统计属性的封闭M-模拟器,由此产生的称为“边缘M-估计”的隐私机制在改进其产出的统计效用方面具有重要潜力,正如一些初步结果所建议的那样。因此,这些结果证明有必要进一步调查使用稳健的统计工具促进差异隐私的必要性。