With advancements in technology, the threats to the privacy of sensitive data (e.g. location data) are surging. A standard method to mitigate the privacy risks for location data is by adding noise to the true values to achieve geo-indistinguishability. However, we argue that geo-indistinguishability alone is insufficient to cover all privacy concerns. In particular, isolated locations are not protected by the state-of-the-art Laplace mechanism (LAP) for geo-indistinguishability. We focus on a mechanism that is generated by the Blahut-Arimoto algorithm (BA) from rate-distortion theory. We show that BA, in addition to providing geo-indistinguishability, enforces an elastic metric that ameliorates the issue of isolation. We then study the utility of BA in terms of the statistical precision that can be derived from the reported data, focusing on the inference of the original distribution. To this purpose, we apply the iterative Bayesian update (IBU), an instance of the famous expectation-maximization method from statistics, that produces the most likely distribution for any obfuscation mechanism. We show that BA harbours a better statistical utility than LAP for high privacy and becomes comparable as privacy decreases. Remarkably, we point out that BA and IBU, two seemingly unrelated methods that were developed for completely different purposes, are dual to each other. Exploiting this duality and the privacy-preserving properties of BA, we propose an iterative method, PRIVIC, for a privacy-friendly incremental collection of location data from users by service providers. In addition to extending the privacy guarantees of geo-indistinguishability and retaining a better statistical utility than LAP, PRIVIC also provides an optimal trade-off between information leakage and quality of service. We illustrate the soundness and functionality of our method both analytically and with experiments.
翻译:随着技术的进步,对敏感数据的隐私(如定位数据)的威胁正在急剧增加。降低定位数据隐私风险的标准方法是在真实值上添加噪音,以实现地理不易分化。然而,我们争辩说,仅地理分化不足以涵盖所有隐私问题。特别是,偏僻地点没有受到最新数据拉贝机制的保护,无法进行地理分化。我们侧重于由Blahut-Arimoto 算法(BABA)产生的机制,降低定位数据的隐私风险。我们显示,除了提供地理不易分解点之外,BA还采用了真实值的噪音值,除了提供地理不易分解点外,还采用了一种弹性数据机制。我们从报告的数据中可以得出BABE的精确度,我们用原始分布的推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推, 。