安全安全聚合:减少联邦学习中的多种隐私疏漏 (Securing Secure Aggregation: Mitigating Multi-Round Privacy Leakage in Federated Learning)

Secure aggregation is a critical component in federated learning, which enables the server to learn the aggregate model of the users without observing their local models. Conventionally, secure aggregation algorithms focus only on ensuring the privacy of individual users in a single training round. We contend that such designs can lead to significant privacy leakages over multiple training rounds, due to partial user selection/participation at each round of federated learning. In fact, we empirically show that the conventional random user selection strategies for federated learning lead to leaking users' individual models within number of rounds linear in the number of users. To address this challenge, we introduce a secure aggregation framework with multi-round privacy guarantees. In particular, we introduce a new metric to quantify the privacy guarantees of federated learning over multiple training rounds, and develop a structured user selection strategy that guarantees the long-term privacy of each user (over any number of training rounds). Our framework also carefully accounts for the fairness and the average number of participating users at each round. We perform several experiments on MNIST and CIFAR-10 datasets in the IID and the non-IID settings to demonstrate the performance improvement over the baseline algorithms, both in terms of privacy protection and test accuracy.

翻译：安全聚合是联合学习的重要组成部分,使服务器能够在不观察当地模式的情况下学习用户的综合模型。通常,安全的聚合算法仅侧重于确保单个用户在单轮培训中的隐私。我们坚持认为,由于部分用户选择/参与每轮联合学习,这种设计可能导致多个培训回合的重大隐私泄漏。事实上,我们从经验上表明,联邦学习的传统随机用户选择战略导致用户个人模型在用户数的直线回合数中泄漏。为了应对这一挑战,我们引入了一个具有多轮隐私保障的安全集合框架。特别是,我们引入了一种新的衡量标准,量化在多轮培训中联合学习的隐私保障,并制定一个结构化的用户选择战略,保障每个用户的长期隐私(超过任何几轮培训回合)。我们的框架还仔细说明每轮参与用户的公平性和平均人数。我们在ID和非IID环境中对MNIST和CIFAR-10数据集进行了几次实验,以显示在基线保护和非IID的精确度测试方面,在基线和精确度上的业绩改进。

相关内容

联邦学习

关注 200

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

【机器学习工具箱(机器学习实用库分类大列表)】《Machine Learning Toolbox》by Amit Chaudhary

专知会员服务

30+阅读 · 2020年7月12日

【UCSD-MIT】深度学习隐私综述论文，Privacy in Deep Learning: A Survey

专知会员服务

68+阅读 · 2020年4月28日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日