Local differential privacy (LDP) is a differential privacy (DP) paradigm in which individuals transmit their data to a curator after applying a DP mechanism to it (often by adding noise). LDP ensures more stringent user privacy protection because the curator does not have access to any of the user's original information. On the curator's side, however, the noise for privacy results in additional bias and variance in their analyses, thus it is of great interest for analysts to understand how noise impacts their analyses and to conduct informative analyses. In this article, we develop statistically valid methodologies to infer causal effects from the privatized data under the Rubin Causal Model framework. First, we present asymptotically unbiased and consistent estimators with their variance estimators and plug-in confidence intervals. Second, we develop a Bayesian nonparametric methodology, which performs well in terms of MSE for tight privacy budgets. A blocked Gibbs sampling algorithm has been developed. Finally, we present simulation studies to evaluate the performance of frequentist and Bayesian methodologies for various privacy budgets, resulting in useful suggestions for performing causal inference for differentially private data.
翻译:当地差异隐私(LDP)是一种差异性隐私模式(DP),在这种模式中,个人在应用DP机制(通常增加噪音)后将其数据传递给保管人(DP),个人可以将其数据传递给保管人(DP),LDP确保更严格地保护用户隐私,因为保管人无法获得任何用户的原始信息。然而,在保管人方面,隐私噪音导致他们的分析中出现更多的偏差和差异,因此分析员非常有兴趣了解噪音如何影响他们的分析并进行信息分析。在本条中,我们制定了具有统计效力的方法,从Rubin Causal模型框架下的私有化数据中推断出因果关系。首先,我们提出了带有差异估计器和插入信任间隔的简单、不偏不倚和一致的估算器。第二,我们开发了一种巴伊斯非对称方法,该方法在MSE对严格的隐私预算方面表现良好。一个被阻断的吉布斯抽样算法已经开发出来。最后,我们提出模拟研究,以评价经常和巴伊西亚方法在各种隐私预算中的表现,从而提出有用的建议对差异性私人数据进行因果关系推断。