Federated learning (FL) has recently emerged as a promising distributed machine learning (ML) paradigm. Practical needs of the "right to be forgotten" and countering data poisoning attacks call for efficient techniques that can remove, or unlearn, specific training data from the trained FL model. Existing unlearning techniques in the context of ML, however, are no longer in effect for FL, mainly due to the inherent distinction in the way how FL and ML learn from data. Therefore, how to enable efficient data removal from FL models remains largely under-explored. In this paper, we take the first step to fill this gap by presenting FedEraser, the first federated unlearning methodology that can eliminate the influence of a federated client's data on the global FL model while significantly reducing the time used for constructing the unlearned FL model.The basic idea of FedEraser is to trade the central server's storage for unlearned model's construction time, where FedEraser reconstructs the unlearned model by leveraging the historical parameter updates of federated clients that have been retained at the central server during the training process of FL. A novel calibration method is further developed to calibrate the retained updates, which are further used to promptly construct the unlearned model, yielding a significant speed-up to the reconstruction of the unlearned model while maintaining the model efficacy. Experiments on four realistic datasets demonstrate the effectiveness of FedEraser, with an expected speed-up of $4\times$ compared with retraining from the scratch. We envision our work as an early step in FL towards compliance with legal and ethical criteria in a fair and transparent manner.
翻译:联邦学习(FL)最近已成为一个有希望的分布式机器学习(ML)范例。“被遗忘的权利”和打击数据中毒袭击的实际需要要求采用高效技术,从训练有素的FL模型中去除或删除具体的培训数据。但是,在ML模型中现有的未学习技术对FL不再有效,这主要是因为FL和ML从数据中学习的方式存在内在的区别。因此,如何使从FL模型中有效提取数据的速度仍然基本不足。在本文中,我们迈出了填补这一差距的第一步,我们展示了Federoraser(FedEraser),这是第一个能消除全球FL模型中联合法律客户数据的影响的不学习方法,同时大大缩短了用于构建不吸取FL模型模型的时间。 FedEraster的基本想法是将中央服务器储存的未经学习模型模型的模型转换到模型的模型的建造时间,而FErasererererr则利用在核心服务器中保存的未更新的历史参数来填补这一空白,在更新过程中,在升级的FL模型的升级过程中,在不断校正一个前的进度中,在不断校正一个不断校正。