Federated Learning (FL) is an efficient distributed machine learning paradigm that employs private datasets in a privacy-preserving manner. The main challenges of FL is that end devices usually possess various computation and communication capabilities and their training data are not independent and identically distributed (non-IID). Due to limited communication bandwidth and unstable availability of such devices in a mobile network, only a fraction of end devices (also referred to as the participants or clients in a FL process) can be selected in each round. Hence, it is of paramount importance to utilize an efficient participant selection scheme to maximize the performance of FL including final model accuracy and training time. In this paper, we provide a review of participant selection techniques for FL. First, we introduce FL and highlight the main challenges during participant selection. Then, we review the existing studies and categorize them based on their solutions. Finally, we provide some future directions on participant selection for FL based on our analysis of the state-of-the-art in this topic area.
翻译:联邦学习组织(FL)是一个高效分布式的机器学习模式,以保持隐私的方式使用私人数据集;FL的主要挑战在于终端设备通常拥有各种计算和通信能力,其培训数据不是独立和完全分布的(非IID)。由于通信带宽有限,移动网络中此类设备的可用性不稳定,只能在每个回合中选择一小部分终端设备(也称为FL进程中的参与者或客户),因此,至关重要的是,利用高效的参与者选择计划最大限度地发挥FL的性能,包括最终模型准确性和培训时间。我们在本文件中审查了FL的参与者选择技术。首先,我们介绍了FL,并突出强调了参与者选择过程中的主要挑战。然后,我们根据现有研究并根据这些研究的解决方案对其进行分类。最后,我们根据我们对这个主题领域的最新技术的分析,为FL的参与者选择提供了一些未来方向。