Secure aggregation is a critical component in federated learning, which enables the server to learn the aggregate model of the users without observing their local models. Conventionally, secure aggregation algorithms focus only on ensuring the privacy of individual users in a single training round. We contend that such designs can lead to significant privacy leakages over multiple training rounds, due to partial user selection/participation at each round of federated learning. In fact, we empirically show that the conventional random user selection strategies for federated learning lead to leaking users' individual models within number of rounds linear in the number of users. To address this challenge, we introduce a secure aggregation framework with multi-round privacy guarantees. In particular, we introduce a new metric to quantify the privacy guarantees of federated learning over multiple training rounds, and develop a structured user selection strategy that guarantees the long-term privacy of each user (over any number of training rounds). Our framework also carefully accounts for the fairness and the average number of participating users at each round. We perform several experiments on MNIST and CIFAR-10 datasets in the IID and the non-IID settings to demonstrate the performance improvement over the baseline algorithms, both in terms of privacy protection and test accuracy.
翻译:安全聚合是联合学习的重要组成部分,使服务器能够在不观察当地模式的情况下学习用户的综合模型。 通常,安全的聚合算法仅侧重于确保单个用户在单轮培训中的隐私。 我们坚持认为,由于部分用户选择/参与每轮联合学习,这种设计可能导致多个培训回合的重大隐私泄漏。事实上,我们从经验上表明,联邦学习的传统随机用户选择战略导致用户个人模型在用户数的直线回合数中泄漏。为了应对这一挑战,我们引入了一个具有多轮隐私保障的安全集合框架。特别是,我们引入了一种新的衡量标准,量化在多轮培训中联合学习的隐私保障,并制定一个结构化的用户选择战略,保障每个用户的长期隐私(超过任何几轮培训回合)。我们的框架还仔细说明每轮参与用户的公平性和平均人数。我们在ID和非IID环境中对MNIST和CIFAR-10数据集进行了几次实验,以显示在基线保护和非IID的精确度测试方面,在基线和精确度上的业绩改进。