Extended differential privacy, a generalization of standard differential privacy (DP) using a general metric, has been widely studied to provide rigorous privacy guarantees while keeping high utility. However, existing works on extended DP are limited to few metrics, such as the Euclidean metric. Consequently, they have only a small number of applications, such as location-based services and document processing. In this paper, we propose a couple of mechanisms providing extended DP with a different metric: angular distance (or cosine distance). Our mechanisms are based on locality sensitive hashing (LSH), which can be applied to the angular distance and work well for personal data in a high-dimensional space. We theoretically analyze the privacy properties of our mechanisms, and prove extended DP for input data by taking into account that LSH preserves the original metric only approximately. We apply our mechanisms to friend matching based on high-dimensional personal data with angular distance in the local model, and evaluate our mechanisms using two real datasets. We show that LDP requires a very large privacy budget and that RAPPOR does not work in this application. Then we show that our mechanisms enable friend matching with high utility and rigorous privacy guarantees based on extended DP.
翻译:扩大的隐私,即使用通用指标的标准化差分隐私(DP)的普及,已经进行了广泛的研究,以提供严格的隐私保障,同时保持高效用;然而,扩展的DP的现有工作仅限于少数量度,例如Euclidean 度量,因此,它们只有少量的应用,例如基于地点的服务和文件处理。在本文件中,我们提议了几个机制,以不同的量度提供扩展的DP:角距离(或连线距离)。我们的机制以对地敏感散列(LSH)为基础,可适用于角距离,在高维空间的个人数据方面运作良好。我们从理论上分析我们机制的隐私性质,并通过考虑到LSH仅大致保留原始度量量量来证明输入数据的扩展DP。我们运用我们的机制,以基于高维度个人数据与本地模型的角距离进行匹配,并使用两个真实的数据集来评估我们的机制。我们显示,LDP需要非常庞大的隐私预算,而RAPPOR在这种应用中不起作用。然后我们表明,我们的机制能够使朋友与高效用和严格的隐私保障与高效用和基于扩展的DP相匹配。