Federated Learning (FL) enables the multiple participating devices to collaboratively contribute to a global neural network model while keeping the training data locally. Unlike the centralized training setting, the non-IID and imbalanced (statistical heterogeneity) training data of FL is distributed in the federated network, which will increase the divergences between the local models and global model, further degrading performance. In this paper, we propose a novel clustered federated learning (CFL) framework FedGroup, in which we 1) group the training of clients based on the similarities between the clients' optimization directions for high training performance; 2) construct a new data-driven distance measure to improve the efficiency of the client clustering procedure. 3) implement a newcomer device cold start mechanism based on the auxiliary global model for framework scalability and practicality. FedGroup can achieve improvements by dividing joint optimization into groups of sub-optimization and can be combined with FL optimizer FedProx. The convergence and complexity are analyzed to demonstrate the efficiency of our proposed framework. We also evaluate FedGroup and FedGrouProx (combined with FedProx) on several open datasets and made comparisons with related CFL frameworks. The results show that FedGroup can significantly improve absolute test accuracy by +14.1% on FEMNIST compared to FedAvg. +3.4% on Sentiment140 compared to FedProx, +6.9% on MNIST compared to FeSEM.
翻译:联邦学习联合会(FL)使多种参与设备能够协作为全球神经网络模式作出贡献,同时保持当地的培训数据。与中央培训环境不同,FL的非IID和不平衡(统计异质性)培训数据在联邦网络中分布,这将增加当地模式与全球模式之间的差异,进一步降低业绩。在本文件中,我们提议了一个新型的集群联合学习联合会框架FedGroup,其中我们1)根据客户对高培训业绩的最佳指导之间的相似之处,将客户培训分组;2)制定新的数据驱动远程措施,以提高客户群集程序的效率。3)在辅助性全球框架可扩缩性和实用性模型的基础上,实施新推出的装置冷启动机制。美化集团可以通过将联合优化分为次级优化组和全球模式,与FL优化框架FFP6FROx。我们分析了这种趋同性和复杂性,以显示我们拟议框架的效率。我们还评估了FDGGroup和FedGrou Prox(与FedProx结合),在几个开放的FDMFC+FL1的绝对性框架方面,可以比FFFFD+FMFMFMFCFC的绝对性框架。