Collaborative deep reinforcement learning (CDRL) algorithms in which multiple agents can coordinate over a wireless network is a promising approach to enable future intelligent and autonomous systems that rely on real-time decision-making in complex dynamic environments. Nonetheless, in practical scenarios, CDRL faces many challenges due to the heterogeneity of agents and their learning tasks, different environments, time constraints of the learning, and resource limitations of wireless networks. To address these challenges, in this paper, a novel semantic-aware CDRL method is proposed to enable a group of heterogeneous untrained agents with semantically-linked DRL tasks to collaborate efficiently across a resource-constrained wireless cellular network. To this end, a new heterogeneous federated DRL (HFDRL) algorithm is proposed to select the best subset of semantically relevant DRL agents for collaboration. The proposed approach then jointly optimizes the training loss and wireless bandwidth allocation for the cooperating selected agents in order to train each agent within the time limit of its real-time task. Simulation results show the superior performance of the proposed algorithm compared to state-of-the-art baselines.
翻译:为了应对这些挑战,本文件提议了一种新型的语义识别法方法,使一组具有与语义相关的DRL任务、未经训练的混合物剂能够在一个受资源限制的无线手机网络之间有效协作。为此,提议采用一种新的混合配制DRL(HFDL)算法,为协作选择最精细的语义相关DRL(HFDL)代理。拟议的方法随后共同优化选定的合作物剂的培训损失和无线带宽分配,以便在实时任务时限内对每个物剂进行培训。模拟结果显示拟议的算法相对于州际基线的优异性表现。