CSAFL: 一组半同步的半同步联邦学习框架 (CSAFL: A Clustered Semi-Asynchronous Federated Learning Framework)

Federated learning (FL) is an emerging distributed machine learning paradigm that protects privacy and tackles the problem of isolated data islands. At present, there are two main communication strategies of FL: synchronous FL and asynchronous FL. The advantages of synchronous FL are that the model has high precision and fast convergence speed. However, this synchronous communication strategy has the risk that the central server waits too long for the devices, namely, the straggler effect which has a negative impact on some time-critical applications. Asynchronous FL has a natural advantage in mitigating the straggler effect, but there are threats of model quality degradation and server crash. Therefore, we combine the advantages of these two strategies to propose a clustered semi-asynchronous federated learning (CSAFL) framework. We evaluate CSAFL based on four imbalanced federated datasets in a non-IID setting and compare CSAFL to the baseline methods. The experimental results show that CSAFL significantly improves test accuracy by more than +5% on the four datasets compared to TA-FedAvg. In particular, CSAFL improves absolute test accuracy by +34.4% on non-IID FEMNIST compared to TA-FedAvg.

翻译：联邦学习(FL)是一个新兴的分布式机器学习模式,它保护隐私,解决孤立的数据岛屿问题。目前,FL有两个主要通信战略:同步的FL和不同步的FL。同步的FL的优点在于该模型具有高度精密和快速的趋同速度。然而,这种同步的通信战略有可能使中央服务器对设备等待太久,即对一些时间紧迫的应用产生消极影响的挤压效应。Asynchronous FL在减轻挤压效应方面有自然优势,但存在模型质量退化和服务器崩溃的威胁。因此,我们结合这两种战略的优势,提出一个组合的半自动断裂式联合学习(CSAFL)框架。我们根据非IID设置中的四种不平衡的 federerate数据集对CSAFL进行了评估,并将CSAFL与基准方法进行比较。实验结果表明,CSAFL在四种数据设置上,比TA-FA+FA的绝对精确度,比TA-FA 具体地改进了C-FA。

相关内容

联邦学习

关注 200

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。