保密和过失容忍分散化学习 (Secure and Fault Tolerant Decentralized Learning)

Federated learning (FL) is a promising paradigm for training a global model over data distributed across multiple data owners without centralizing clients' raw data. However, sharing of local model updates can also reveal information of clients' local datasets. Trusted execution environments (TEEs) within the FL server have been recently deployed by companies like Meta for secure aggregation. However, secure aggregation can suffer from error-prone local updates sent by clients that become faulty during training due to underlying device malfunctions. Also, data heterogeneity across clients makes fault mitigation challenging, as even updates from normal clients are dissimilar. Thus, most of the prior fault tolerant methods, which treat any local update differing from the majority of other updates as faulty, perform poorly. We propose DiverseFL to make model aggregation secure as well as robust to faults. In DiverseFL, any client whose local model update diverges from its associated guiding update is tagged as being faulty. To implement our novel per-client criteria for fault mitigation, DiverseFL creates a TEE-based secure enclave within the FL server, which in addition to performing secure aggregation for carrying out the global model update step, securely receives a small representative sample of local data from each client only once before training, and computes guiding updates for each participating client during training. Thus, DiverseFL provides security against privacy leakage as well as robustness against faulty clients. In experiments, DiverseFL consistently achieves significant improvements in absolute test accuracy over prior fault mitigation benchmarks. DiverseFL also performs closely to OracleSGD, where server combines updates only from the normal clients. We also analyze the convergence rate of DiverseFL under non-IID data and standard convexity assumptions.

翻译：联邦学习(FL)是针对多个数据所有者在不集中客户原始数据的情况下传播的数据进行全球模型培训的一个很有希望的范例。然而,分享本地模型更新也能够揭示客户本地数据集的信息。FL服务器中信任的执行环境最近被Meta等公司部署用于安全汇总。然而,客户在培训期间发送的本地最新信息如果因设备故障而出现错误,则可能因客户在培训过程中出现错误而出现错误而出现故障。此外,客户之间的数据差异性差异性使得减少错误的绝对准确性变得困难,因为正常客户的更新情况也不同。因此,大部分以往的不易出错容忍方法(这些方法把与大多数其他更新不同的本地更新视为错误)也能揭示客户本地数据集的信息。我们建议,FLFL服务器中的本地模型更新与相关隐私更新发生错误性差异性差性(Everflllll),在每次测试客户的升级之前,除了进行不安全的不易变易变易变易变易变易变,在每部客户的升级之前,还进行不易变易变易变易变的升级。