联邦不学习 (Federated Unlearning)

Federated learning (FL) has recently emerged as a promising distributed machine learning (ML) paradigm. Practical needs of the "right to be forgotten" and countering data poisoning attacks call for efficient techniques that can remove, or unlearn, specific training data from the trained FL model. Existing unlearning techniques in the context of ML, however, are no longer in effect for FL, mainly due to the inherent distinction in the way how FL and ML learn from data. Therefore, how to enable efficient data removal from FL models remains largely under-explored. In this paper, we take the first step to fill this gap by presenting FedEraser, the first federated unlearning methodology that can eliminate the influence of a federated client's data on the global FL model while significantly reducing the time used for constructing the unlearned FL model.The basic idea of FedEraser is to trade the central server's storage for unlearned model's construction time, where FedEraser reconstructs the unlearned model by leveraging the historical parameter updates of federated clients that have been retained at the central server during the training process of FL. A novel calibration method is further developed to calibrate the retained updates, which are further used to promptly construct the unlearned model, yielding a significant speed-up to the reconstruction of the unlearned model while maintaining the model efficacy. Experiments on four realistic datasets demonstrate the effectiveness of FedEraser, with an expected speed-up of $4\times$ compared with retraining from the scratch. We envision our work as an early step in FL towards compliance with legal and ethical criteria in a fair and transparent manner.

翻译：联邦学习(FL)最近已成为一个有希望的分布式机器学习(ML)范例。“被遗忘的权利”和打击数据中毒袭击的实际需要要求采用高效技术,从训练有素的FL模型中去除或删除具体的培训数据。但是,在ML模型中现有的未学习技术对FL不再有效,这主要是因为FL和ML从数据中学习的方式存在内在的区别。因此,如何使从FL模型中有效提取数据的速度仍然基本不足。在本文中,我们迈出了填补这一差距的第一步,我们展示了Federoraser(FedEraser),这是第一个能消除全球FL模型中联合法律客户数据的影响的不学习方法,同时大大缩短了用于构建不吸取FL模型模型的时间。 FedEraster的基本想法是将中央服务器储存的未经学习模型模型的模型转换到模型的模型的建造时间,而FErasererererr则利用在核心服务器中保存的未更新的历史参数来填补这一空白,在更新过程中,在升级的FL模型的升级过程中,在不断校正一个前的进度中,在不断校正一个不断校正。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/