We propose a novel Bayesian inference framework for distributed differentially private linear regression. We consider a distributed setting where multiple parties hold parts of the data and share certain summary statistics of their portions in privacy-preserving noise. We develop a novel generative statistical model for privately shared statistics, which exploits a useful distributional relation between the summary statistics of linear regression. Bayesian estimation of the regression coefficients is conducted mainly using Markov chain Monte Carlo algorithms, while we also provide a fast version to perform Bayesian estimation in one iteration. The proposed methods have computational advantages over their competitors. We provide numerical results on both real and simulated data, which demonstrate that the proposed algorithms provide well-rounded estimation and prediction.
翻译:我们为分布式的私人线性回归提出了一个新颖的贝叶西亚推论框架。我们考虑一个分布式环境,让多方持有部分数据并分享各自部分的某些简要统计,以保持隐私的噪音。我们为私人共享统计开发了一个新型的基因化统计模型,利用线性回归简要统计之间的有益分配关系。贝叶西亚对回归系数的估计主要使用Markov链 Monte Carlo算法进行,同时我们还提供一个快速的版本,在一个迭代中进行巴伊西亚估算。拟议方法在计算上优于其竞争对手。我们提供了真实和模拟数据的数字结果,表明拟议的算法提供了周密的估计和预测。