Most published work on differential privacy (DP) focuses exclusively on meeting privacy constraints, by adding to the query noise with a pre-specified parametric distribution model, typically with one or two degrees of freedom. The accuracy of the response and its utility to the intended use are frequently overlooked. Considering that several database queries are categorical in nature (e.g., a label, a ranking, etc.), or can be quantized, the parameters that define the randomized mechanism's distribution are finite. Thus, it is reasonable to search through numerical optimization for the probability masses that meet the privacy constraints while minimizing the query distortion. Considering the modulo summation of random noise as the DP mechanism, the goal of this paper is to introduce a tractable framework to design the optimum noise probability mass function (PMF) for database queries with a discrete and finite set, optimizing with an expected distortion metric for a given $(\epsilon,\delta)$. We first show that the optimum PMF can be obtained by solving a mixed integer linear program (MILP). Then, we derive closed-form solutions for the optimum PMF that minimize the probability of error for two special cases. We show numerically that the proposed optimal mechanisms significantly outperform the state-of-the-art.
翻译:有关不同隐私(DP)的已出版工作大多完全侧重于满足隐私限制,办法是在查询噪音中增加预先指定的参数分布模型,通常有1至2度的自由度。答复的准确性及其对预定用途的效用经常被忽视。考虑到若干数据库查询具有绝对性(例如标签、排名等),或可以量化,界定随机化机制分布的参数是有限的。因此,合理的做法是通过数字优化搜索满足隐私限制的概率质量,同时尽量减少查询扭曲。考虑到随机噪音作为DP机制的模调和,本文件的目标是引入一个可移植的框架,设计最佳噪音概率质量功能,用于独立和有限的数据库查询,以预期的扭曲度为给定的(cepslon,\delta)值优化。我们首先表明,通过解决混合整数线性程序(MILP)可以取得最佳的PMF。然后,我们为最佳PMF提出封闭式解决方案,以尽量减少两种特殊案例的最佳误差率。我们展示了两种特殊案例的拟议数字机制。我们展示了最优性的数字机制。