Personalized federated learning allows for clients in a distributed system to train a neural network tailored to their unique local data while leveraging information at other clients. However, clients' models are vulnerable to attacks during both the training and testing phases. In this paper we address the issue of adversarial clients crafting evasion attacks at test time to deceive other clients. For example, adversaries may aim to deceive spam filters and recommendation systems trained with personalized federated learning for monetary gain. The adversarial clients have varying degrees of personalization based on the method of distributed learning, leading to a "grey-box" situation. We are the first to characterize the transferability of such internal evasion attacks for different learning methods and analyze the trade-off between model accuracy and robustness depending on the degree of personalization and similarities in client data. We introduce a defense mechanism, pFedDef, that performs personalized federated adversarial training while respecting resource limitations at clients that inhibit adversarial training. Overall, pFedDef increases relative grey-box adversarial robustness by 62% compared to federated adversarial training and performs well even under limited system resources.
翻译:个人化联谊学习使客户能够在分布式系统中培训适应其独特的本地数据的神经网络,同时利用其他客户的信息。然而,客户的模式在培训和测试阶段都容易受到攻击。在本文件中,我们处理的是敌对客户在测试时编造逃避攻击以欺骗其他客户的问题。例如,对手可能试图欺骗通过个人化联合会学习获得金钱收益的垃圾过滤器和建议系统。对口客户根据分布式学习方法具有不同程度的个人化,导致“灰盒”的情况。我们首先确定这种内部规避攻击在不同的学习方法下的可转移性,并根据个人化的程度和客户数据的相似性分析模型准确性和稳健性之间的权衡。我们引入了一种防御机制,即PFedDef,进行个性化联合对抗训练,同时尊重抑制对抗性训练的客户的资源限制。总体而言,PFedDef增加了相对灰盒对抗性强性,比联邦化的对抗性训练增加了62%,甚至处于有限的系统资源之下。