Differential privacy (DP) defines privacy protection by promising quantified indistinguishability between individuals that consent to share their privacy-sensitive information and the ones that do not. DP aims to deliver this promise by including well-crafted elements of random noise in the published data and thus there is an inherent trade-off between the degree of privacy protection and the ability to utilize the protected data. Currently, several open-source tools were proposed for DP provision. To the best of our knowledge, there is no comprehensive study for comparing these open-source tools with respect to their ability to balance DP's inherent trade-off as well as the use of system resources. This work proposes an open-source evaluation framework for privacy protection solutions and offers evaluation for OpenDP Smartnoise, Google DP, PyTorch Opacus, Tensorflow Privacy, and Diffprivlib. In addition to studying their ability to balance the above trade-off, we consider discrete and continuous attributes by quantifying their performance under different data sizes. Our results reveal several patterns that developers should have in mind when selecting tools under different application needs and criteria. This evaluation survey can be the basis for an improved selection of open-source DP tools and quicker adaptation of DP.
翻译:不同隐私(DP)定义了隐私保护,承诺同意分享隐私敏感信息的个人与同意分享隐私敏感信息的个人之间有量化的不可区分性。DP的目标是实现这一承诺,在公布的数据中纳入精心设计的随机噪音要素,从而在隐私保护程度和利用受保护数据的能力之间有着内在的权衡。目前,为DP的提供提议了若干开放源码工具。根据我们的知识,没有进行全面研究,比较这些开放源码工具在平衡DP内在的取舍以及系统资源的使用方面的能力。这项工作提出了隐私保护解决方案的开放源码评价框架,并为OpenDP Smartnoise、Google DP、PyTorrch Opacus、Tensorflow Preial和Diffprivlib提供了评价。除了研究它们平衡上述交易的能力外,我们还考虑通过量化不同数据尺寸的绩效来保持离散和连续的属性。我们的研究结果显示,开发者在根据不同应用需要和标准选择工具时,应当想到几种模式。本次评价调查可以作为更快速选择开放源工具的基础。