Federated learning (FL) is an emerging distributed machine learning paradigm that protects privacy and tackles the problem of isolated data islands. At present, there are two main communication strategies of FL: synchronous FL and asynchronous FL. The advantages of synchronous FL are that the model has high precision and fast convergence speed. However, this synchronous communication strategy has the risk that the central server waits too long for the devices, namely, the straggler effect which has a negative impact on some time-critical applications. Asynchronous FL has a natural advantage in mitigating the straggler effect, but there are threats of model quality degradation and server crash. Therefore, we combine the advantages of these two strategies to propose a clustered semi-asynchronous federated learning (CSAFL) framework. We evaluate CSAFL based on four imbalanced federated datasets in a non-IID setting and compare CSAFL to the baseline methods. The experimental results show that CSAFL significantly improves test accuracy by more than +5% on the four datasets compared to TA-FedAvg. In particular, CSAFL improves absolute test accuracy by +34.4% on non-IID FEMNIST compared to TA-FedAvg.
翻译:联邦学习(FL)是一个新兴的分布式机器学习模式,它保护隐私,解决孤立的数据岛屿问题。目前,FL有两个主要通信战略:同步的FL和不同步的FL。同步的FL的优点在于该模型具有高度精密和快速的趋同速度。然而,这种同步的通信战略有可能使中央服务器对设备等待太久,即对一些时间紧迫的应用产生消极影响的挤压效应。Asynchronous FL在减轻挤压效应方面有自然优势,但存在模型质量退化和服务器崩溃的威胁。因此,我们结合这两种战略的优势,提出一个组合的半自动断裂式联合学习(CSAFL)框架。我们根据非IID设置中的四种不平衡的 federerate数据集对CSAFL进行了评估,并将CSAFL与基准方法进行比较。实验结果表明,CSAFL在四种数据设置上,比TA-FA+FA的绝对精确度,比TA-FA 具体地改进了C-FA。