With the increasingly strengthened data privacy act and the difficult data centralization, Federated Learning (FL) has become an effective solution to collaboratively train the model while preserving each client's privacy. FedAvg is a standard aggregation algorithm that makes the proportion of dataset size of each client as aggregation weight. However, it can't deal with non-independent and identically distributed (non-i.i.d) data well because of its fixed aggregation weights and the neglect of data distribution. In this paper, we propose an aggregation strategy that can effectively deal with non-i.i.d dataset, namely FedGraph, which can adjust the aggregation weights adaptively according to the training condition of local models in whole training process. The FedGraph takes three factors into account from coarse to fine: the proportion of each local dataset size, the topology factor of model graphs, and the model weights. We calculate the gravitational force between local models by transforming the local models into topology graphs. The FedGraph can explore the internal correlation between local models better through the weighted combination of the proportion each local dataset, topology structure, and model weights. The proposed FedGraph has been applied to the MICCAI Federated Tumor Segmentation Challenge 2021 (FeTS) datasets, and the validation results show that our method surpasses the previous state-of-the-art by 2.76 mean Dice Similarity Score. The source code will be available at Github.
翻译:随着数据隐私法的日益强化和数据集中化的困难程度,FedAvg(FedAvg)已成为合作培训模型并同时保护每个客户隐私的有效解决办法。FedAvg是一个标准汇总算法,它将每个客户的数据集大小比例作为总加权数。然而,它无法很好地处理非独立和同样分布的数据(非一.一.d)数据(非一.d)数据),因为其固定的聚合权重和对数据分布的忽视。在本文中,我们提出了一个可以有效处理非i.i.d数据集(即FedGraph)的汇总战略,即FedGraph(FedGraph),它可以在整个培训过程中根据当地模型的培训条件调整汇总权重。FedGraph将三个因素从粗略到细考虑:每个本地数据集大小的比例、模型图示的表因数系数和模型重量。我们通过将地方模型转换成表图表的图表图表。FedGraphrph可以通过对每个本地数据比例的加权组合来更好地探讨地方模型之间的内部关联关系。在FedFinFSreal的数值结构结构结构结构结构中,根据以前的模型和图表结构显示了我们应用的数值结构的模型的模型和图表结构显示。