封闭的逐渐后裔:保护隐私优化联邦学习 (Confined Gradient Descent: Privacy-preserving Optimization for Federated Learning)

Federated learning enables multiple participants to collaboratively train a model without aggregating the training data. Although the training data are kept within each participant and the local gradients can be securely synthesized, recent studies have shown that such privacy protection is insufficient. The global model parameters that have to be shared for optimization are susceptible to leak information about training data. In this work, we propose Confined Gradient Descent (CGD) that enhances privacy of federated learning by eliminating the sharing of global model parameters. CGD exploits the fact that a gradient descent optimization can start with a set of discrete points and converges to another set at the neighborhood of the global minimum of the objective function. It lets the participants independently train on their local data, and securely share the sum of local gradients to benefit each other. We formally demonstrate CGD's privacy enhancement over traditional FL. We prove that less information is exposed in CGD compared to that of traditional FL. CGD also guarantees desired model accuracy. We theoretically establish a convergence rate for CGD. We prove that the loss of the proprietary models learned for each participant against a model learned by aggregated training data is bounded. Extensive experimental results on two real-world datasets demonstrate the performance of CGD is comparable with the centralized learning, with marginal differences on validation loss (mostly within 0.05) and accuracy (mostly within 1%).

翻译：联邦学习使多个参与者能够在不汇总培训数据的情况下合作培训模式。虽然培训数据保存在每位参与者内部,当地梯度可以安全地合成,但最近的研究表明,这种隐私保护是不够的。为了优化而共享的全球模型参数很容易泄露培训数据的信息。我们在此工作中提议,封闭式渐离层(CGD)通过消除共享全球模型参数来增加联盟学习的隐私。CGD利用了一个事实,即梯度下降优化可以从一组离散点开始,并汇集到全球最低目标功能附近的另一组。它让参与者独立地培训自己的本地数据,并安全地分享本地梯度的总和,以相互受益。我们正式展示CGD对传统FL的隐私增强。我们证明,与传统的FL. CGD还保证了理想的模型准确性。我们理论上为CGD确定了一种趋同率。我们证明,每个参与者学习的专有型模型相对于通过汇总培训数据获得的模型的损失,其地方梯度总和最接近的CRBY数据,其总体实验结果与最接近的CRBD的精确性。

相关内容

联邦学习

关注 199

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【机器学习术语宝典】机器学习中英文术语表

专知会员服务

61+阅读 · 2020年7月12日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日