For mitigating Byzantine behaviors in federated learning (FL), most state-of-the-art approaches, such as Bulyan, tend to leverage the similarity of updates from the benign clients. However, in many practical FL scenarios, data is non-IID across clients, thus the updates received from even the benign clients are quite dissimilar. Hence, using similarity based methods result in wasted opportunities to train a model from interesting non-IID data, and also slower model convergence. We propose DiverseFL to overcome this challenge in heterogeneous data distribution settings. Rather than comparing each client's update with other client updates to detect Byzantine clients, DiverseFL compares each client's update with a guiding update of that client. Any client whose update diverges from its associated guiding update is then tagged as a Byzantine node. The FL server in DiverseFL computes the guiding update in every round for each client over a small sample of the client's local data that is received only once before start of the training. However, sharing even a small sample of client's data with the FL server can compromise client's data privacy needs. To tackle this challenge, DiverseFL creates a Trusted Execution Environment (TEE)-based enclave to receive each client's sample and to compute its guiding updates. TEE provides a hardware assisted verification and attestation to each client that its data is not leaked outside of TEE. Through experiments involving neural networks, benchmark datasets and popular Byzantine attacks, we demonstrate that DiverseFL not only performs Byzantine mitigation quite effectively, it also almost matches the performance of OracleSGD, where the server only aggregates the updates from the benign clients.
翻译:为了减少联盟学习中的拜占庭行为,大多数最先进的方法,如Bulyan, 都倾向于利用来自友好客户的最新消息的相似性。然而,在许多实用的FL假设中,数据是客户之间非IID的数据,因此,即使来自友好客户的最新消息也大相径庭。因此,基于相似性的方法导致浪费了从有趣的非IID数据中培训模型的机会,也降低了模型的趋同速度。我们建议多样性FlFL在数据分配设置上克服这一挑战。我们建议将每个客户的最新消息与其他客户的更新数据进行比较,以便有效地检测Byantine客户的客户最新消息,从而有效地检测Byartine客户的最新消息。在很多实际的FLFL假设中,每个客户的最新消息与相关指导更新的Byzantine值不相悖,因此,在VIFLFL服务器的每轮更新中,每个客户的指南更新只是从一个小样本中收到的本地数据,在培训开始前只有一次显示。然而,我们与FLSeral服务器的少量客户最新数据样本, 而不是通过每个客户的服务器更新的SildFER数据库, 需要一个稳定客户的最新数据。