We investigate a data quality-aware dynamic client selection problem for multiple federated learning (FL) services in a wireless network, where each client offers dynamic datasets for the simultaneous training of multiple FL services, and each FL service demander has to pay for the clients under constrained monetary budgets. The problem is formalized as a non-cooperative Markov game over the training rounds. A multi-agent hybrid deep reinforcement learning-based algorithm is proposed to optimize the joint client selection and payment actions, while avoiding action conflicts. Simulation results indicate that our proposed algorithm can significantly improve training performance.
翻译:我们调查一个无线网络中多个联合学习(FL)服务的数据质量动态客户选择问题,每个客户都提供动态数据集,用于同时培训多个FL服务,每个FL服务需求者必须在受限制的货币预算下为客户付费。 这个问题在培训回合中被正式确定为不合作的Markov游戏。 提议采用多剂混合强化深层学习算法优化联合客户选择和支付行动,同时避免行动冲突。 模拟结果表明,我们提议的算法可以大大改善培训业绩。