Differential privacy (DP) allows data analysts to query databases that contain users' sensitive information while providing a quantifiable privacy guarantee to users. Recent interactive DP systems such as APEx provide accuracy guarantees over the query responses, but fail to support a large number of queries with a limited total privacy budget, as they process incoming queries independently from past queries. We present an interactive, accuracy-aware DP query engine, CacheDP, which utilizes a differentially private cache of past responses, to answer the current workload at a lower privacy budget, while meeting strict accuracy guarantees. We integrate complex DP mechanisms with our structured cache, through novel cache-aware DP cost optimization. Our thorough evaluation illustrates that CacheDP can accurately answer various workload sequences, while lowering the privacy loss as compared to related work.
翻译:不同隐私(DP)允许数据分析员查询含有用户敏感信息的数据库,同时向用户提供可量化的隐私保障。最近,APEx等互动式DP系统为查询答复提供了准确性保障,但未能以有限的全面隐私预算支持大量查询,因为这些查询与过去查询无关。 我们提出了一个互动的、准确性的DP查询引擎,即CacheDP,它利用对过去答复的不同私人缓存,以较低隐私预算应对当前的工作量,同时满足严格的准确性保障。我们通过新的缓冲识别DP成本优化,将复杂的DP机制与结构化缓存结合起来。 我们的彻底评估表明,CacheDP可以准确回答各种工作量序列,同时降低隐私损失与相关工作相比。