Federated learning (FL), an attractive and promising distributed machine learning paradigm, has sparked extensive interest in exploiting tremendous data stored on ubiquitous mobile devices. However, conventional FL suffers severely from resource heterogeneity, as clients with weak computational and communication capability may be unable to complete local training using the same local training hyper-parameters. In this paper, we propose Dap-FL, a deep deterministic policy gradient (DDPG)-assisted adaptive FL system, in which local learning rates and local training epochs are adaptively adjusted by all resource-heterogeneous clients through locally deployed DDPG-assisted adaptive hyper-parameter selection schemes. Particularly, the rationality of the proposed hyper-parameter selection scheme is confirmed through rigorous mathematical proof. Besides, due to the thoughtlessness of security consideration of adaptive FL systems in previous studies, we introduce the Paillier cryptosystem to aggregate local models in a secure and privacy-preserving manner. Rigorous analyses show that the proposed Dap-FL system could guarantee the security of clients' private local models against chosen-plaintext attacks and chosen-message attacks in a widely used honest-but-curious participants and active adversaries security model. In addition, through ingenious and extensive experiments, the proposed Dap-FL achieves higher global model prediction accuracy and faster convergence rates than conventional FL, and the comprehensiveness of the adjusted local training hyper-parameters is validated. More importantly, experimental results also show that the proposed Dap-FL achieves higher model prediction accuracy than two state-of-the-art RL-assisted FL methods, i.e., 6.03% higher than DDPG-based FL and 7.85% higher than DQN-based FL.
翻译:联邦学习(FL)是一个有吸引力和有希望的分布式机器学习模式,它激发了人们对利用在无处不在的移动设备上储存的巨大数据的广泛兴趣,然而,常规FL却因资源差异性而深受资源差异化的困扰,因为计算和通信能力薄弱的客户可能无法使用相同的本地培训超参数完成本地培训。在本文中,我们提议Dap-FL(一种由极低确定性的政策梯度(DDPG)辅助的适应性FL(DPG)系统),在这个系统中,当地学习率和当地培训速度都由所有资源过多的客户通过当地部署的DDPG(DDP)辅助的适应性适应性全面超参数选择计划进行适应调整。特别是,拟议的超标准计算和通信能力强的超标准选择选择选择系统是否合理,此外,由于在以往的研究中对适应性FL(DL)系统进行安全考虑的考虑不够周全,我们采用基于安全和隐私的模型的快速模式(DPL)系统,拟议的Dap-FL系统可以保证客户的私人模型安全性更安全性、更精确的精确的准确性攻击和选择性(FL-L-L-L)和广泛实验参与者采用两种方法。