Counting the number of distinct elements distributed over multiple data holders is a fundamental problem with many real-world applications ranging from crowd counting to network monitoring. Although a number of space and computational efficient sketch methods (e.g., the Flajolet-Martin sketch and the HyperLogLog sketch) for cardinality estimation have been proposed to solve the above problem, these sketch methods are insecure when considering privacy concerns related to the use of each data holder's personal dataset. Despite a recently proposed protocol that successfully implements the well-known Flajolet-Martin (FM) sketch on a secret-sharing based multiparty computation (MPC) framework for solving the problem of private distributed cardinality estimation (PDCE), we observe that this MPC-FM protocol is not differentially private. In addition, the MPC-FM protocol is computationally expensive, which limits its applications to data holders with limited computation resources. To address the above issues, in this paper we propose a novel protocol DP-DICE, which is computationally efficient and differentially private for solving the problem of PDCE. Experimental results show that our DP-DICE achieves orders of magnitude speedup and reduces the estimation error by several times in comparison with state-of-the-arts under the same security requirements.
翻译:计算在多个数据持有者之间分布的不同要素的数量是一个根本性问题,许多现实世界应用都存在的问题,从人群计数到网络监测等,尽管为解决上述问题,提出了许多用于基本估计的空间和计算高效的草图方法(如Flajolet-Martin草图和HyperLogLog草图),但考虑到与使用每个数据持有者个人数据集有关的隐私问题,这些草图方法没有保障。尽管最近提出的一项议定书成功地执行了众所周知的Flajolet-Martin(FM)关于基于多方计算(MPC)的秘密共享框架草图,以解决私人分布的基点估计(PDCE)问题,但我们认为,这一MPC-MM协议并非有差别的私有性协议。此外,MPC-MM协议计算费用昂贵,将其应用限于有限的计算资源有限的数据持有者。为了解决上述问题,我们在本文件中提出了一个新的协议DP-DICE(FM),这是计算高效和差别的私人协议,以解决PDCE问题。 实验结果显示,我们的DP-DICE(PDCE)在比照度方面实现了安全速度要求,并减少国家错误。