PAC 安全:自动隐私测量和数据处理控制 (PAC Security: Automatic Privacy Measurement and Control of Data Processing)

We propose and study a new privacy definition, termed Probably Approximately Correct (PAC) Security. PAC security characterizes the information-theoretic hardness to recover sensitive data given arbitrary information disclosure/leakage during/after any processing. Unlike the classic cryptographic definition and Differential Privacy (DP), which consider the adversarial (input-independent) worst case, PAC security is a simulatable metric that quantifies the instance-based impossibility of inference. A fully automatic analysis and proof generation framework is proposed: security parameters can be produced with arbitrarily high confidence via Monte-Carlo simulation for any black-box data processing oracle. This appealing automation property enables analysis of complicated data processing, where the worst-case proof in the classic privacy regime could be loose or even intractable. Moreover, we show that the produced PAC security guarantees enjoy simple composition bounds and the automatic analysis framework can be implemented in an online fashion to analyze the composite PAC security loss even under correlated randomness. On the utility side, the magnitude of (necessary) perturbation required in PAC security is not lower bounded by $\Theta(\sqrt{d})$ for a $d$-dimensional release but could be O(1) for many practical data processing tasks, which is in contrast to the input-independent worst-case information-theoretic lower bound. Example applications of PAC security are included with comparisons to existing works.

翻译：我们提议并研究一个新的隐私定义,称为“可能最正确(PAC)安全 ” 。 PAC安全是信息理论硬性,以在任何处理过程中/处理后任意披露/泄漏信息时任意披露/泄漏信息,恢复敏感数据。与传统的加密定义和差异隐私(DP)不同,后者认为对抗(独立)最差的对立(独立)最差情况,PAC安全是一种可模拟的衡量标准,它量化了无法以实例为基础的推推推推的可能性。提出了完全自动的分析和证据生成框架:安全参数可以通过蒙特-卡罗模拟为任何黑盒数据处理或触黑箱数据处理或触雷器以任意的高度信心生成。这种上诉自动化属性使得能够分析复杂的数据处理,而传统的隐私制度最坏的证明可能是松散甚至棘手的。此外,我们表明,产生的PAC安全(独立) 安全保证享有简单的构成约束,并且自动分析框架可以在线实施,以分析以实例为基础的无法测算,即使是在相关随机随机分析的情况下,综合PAC安全损失的综合PAC安全损失。在效用方面,PAC安全需要(必要的)现有(最深)的(必要的)过深层安全程度,但PAC安全需要的(但最小的(最小的)的(最小的)范围范围范围不是受美元的约束,但由O(美元/西的)应用应用应用应用应用的A-最小的A-最低的A-最低的、最小的、最小的、最小的、最小的数据处理任务中,包括投入的A-最小的、最小的、最小的、最小的、最小的、最小的、最小的、最小的、最小的、最小的、最小的、最的、最小的、最小的、最小的、最小的、最的处理的处理的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最的、最