In federated learning (FL), model training is distributed over clients and local models are aggregated by a central server. The performance of uploaded models in such situations can vary widely due to imbalanced data distributions, potential demands on privacy protections, and quality of transmissions. In this paper, we aim to minimize FL training delay over wireless channels, constrained by overall training performance as well as each client's differential privacy (DP) requirement. We solve this problem in the framework of multi-agent multi-armed bandit (MAMAB) to deal with the situation where there are multiple clients confornting different unknown transmission environments, e.g., channel fading and interferences. Specifically, we first transform the long-term constraints on both training performance and each client's DP into a virtual queue based on the Lyapunov drift technique. Then, we convert the MAMAB to a max-min bipartite matching problem at each communication round, by estimating rewards with the upper confidence bound (UCB) approach. More importantly, we propose two efficient solutions to this matching problem, i.e., modified Hungarian algorithm and greedy matching with a better alternative (GMBA), in which the first one can achieve the optimal solution with a high complexity while the second one approaches a better trade-off by enabling a verified low-complexity with little performance loss. In addition, we develop an upper bound on the expected regret of this MAMAB based FL framework, which shows a linear growth over the logarithm of communication rounds, justifying its theoretical feasibility. Extensive experimental results are conducted to validate the effectiveness of our proposed algorithms, and the impacts of various parameters on the FL performance over wireless edge networks are also discussed.
翻译:在联合学习(FL)中,模型培训分布在客户之间,当地模型由中央服务器汇总;由于数据分布不平衡、对隐私保护的潜在要求以及传输质量等原因,上传模型在此种情况下的性能可能大不相同。在本文中,我们的目标是尽量减少无线频道的FL培训延迟,这受总体培训业绩以及每个客户的不同隐私(DP)要求的限制。我们在多试多臂多臂土匪(MAMAB)的框架内解决这个问题,以应对存在不同未知传输环境的多个客户,例如频道淡化和干扰。具体地说,我们首先将培训业绩和每个客户的DP的长期限制转化为基于Lyapunov漂移技术的虚拟排队。然后,我们把MAMAB转换成每轮通信的最大双向匹配问题,用上层信任(UCB)方法估算收益。更重要的是,我们提出两个高效的匹配问题解决方案,即:修改匈牙利的算法和贪婪,同时用一种更佳的法程,同时用一种最优的法程方法,我们用一种最优的通货路进行。