Federated learning (FL) is vulnerable to model poisoning attacks, in which malicious clients corrupt the global model via sending manipulated model updates to the server. Existing defenses mainly rely on Byzantine-robust FL methods, which aim to learn an accurate global model even if some clients are malicious. However, they can only resist a small number of malicious clients in practice. It is still an open challenge how to defend against model poisoning attacks with a large number of malicious clients. Our FLDetector addresses this challenge via detecting malicious clients. FLDetector aims to detect and remove the majority of the malicious clients such that a Byzantine-robust FL method can learn an accurate global model using the remaining clients. Our key observation is that, in model poisoning attacks, the model updates from a client in multiple iterations are inconsistent. Therefore, FLDetector detects malicious clients via checking their model-updates consistency. Roughly speaking, the server predicts a client's model update in each iteration based on its historical model updates using the Cauchy mean value theorem and L-BFGS, and flags a client as malicious if the received model update from the client and the predicted model update are inconsistent in multiple iterations. Our extensive experiments on three benchmark datasets show that FLDetector can accurately detect malicious clients in multiple state-of-the-art model poisoning attacks. After removing the detected malicious clients, existing Byzantine-robust FL methods can learn accurate global models.Our code is available at https://github.com/zaixizhang/FLDetector.
翻译:联邦学习组织(FL)很容易受到模式中毒袭击,其中恶意客户通过向服务器发送经过操纵的模型更新而腐蚀全球模型;现有的防御主要依靠Byzantine-robust FL方法,这些方法旨在学习准确的全球模型,即使有些客户是恶意的;然而,他们实际上只能抵制少数恶意客户;它仍然是一个公开的挑战,如何用大量恶意客户来防御模式中毒袭击;我们的FL探测器通过探测恶意客户来应对这一挑战;FL探测器的目的是检测和清除大多数恶意客户,例如Byzantine-robust FL方法能够利用剩余客户学习准确的全球模型。我们的主要观察是,在模式中毒袭击中,客户的模型更新不一致。因此,FL探测器通过检查模型更新的更多信息,检测恶意客户的恶意。粗略地说,服务器根据历史模型更新,利用Causurving 表示的温度和L-BIGS-FL方法,将一个客户的准确的模型作为恶意的客户。我们从模型/Florder测试中不断更新。