Federated learning has emerged as a new paradigm of collaborative machine learning; however, it has also faced several challenges such as non-independent and identically distributed(IID) data and high communication cost. To this end, we propose a novel framework of IID and communication-aware group federated learning that simultaneously maximizes both accuracy and communication speed by grouping nodes based on data distributions and physical locations of the nodes. Furthermore, we provide a formal convergence analysis and an efficient optimization algorithm called FedAvg-IC. Experimental results show that, compared with the state-of-the-art algorithms, FedAvg-IC improved the test accuracy by up to 22.2% and simultaneously reduced the communication time to as small as 12%.
翻译:联邦学习已成为合作机器学习的新范例;然而,联邦学习也面临若干挑战,如非独立和同样分布的(IID)数据和高通信成本。为此,我们提议建立一个新的ID和通信意识小组联合学习框架,通过根据数据分布和节点的物理位置对节点进行分组,同时最大限度地提高准确性和通信速度。此外,我们提供了正式的趋同分析和一个称为FedAvg-IC的高效优化算法。实验结果表明,与最先进的算法相比,FedAvg-IC提高了测试精度22.2%,同时将通信时间减少到小到12%。