Proactive edge association is capable of improving wireless connectivity at the cost of increased handover (HO) frequency and energy consumption, while relying on a large amount of private information sharing required for decision making. In order to improve the connectivity-cost trade-off without privacy leakage, we investigate the privacy-preserving joint edge association and power allocation (JEAPA) problem in the face of the environmental uncertainty and the infeasibility of individual learning. Upon modelling the problem by a decentralized partially observable Markov Decision Process (Dec-POMDP), it is solved by federated multi-agent reinforcement learning (FMARL) through only sharing encrypted training data for federatively learning the policy sought. Our simulation results show that the proposed solution strikes a compelling trade-off, while preserving a higher privacy level than the state-of-the-art solutions.
翻译:积极边缘协会能够改善无线连通,其代价是增加移交(HO)频率和能源消耗,同时依赖大量决策所需的私人信息共享。为了改善连通成本的权衡,不泄露隐私,我们调查在面对环境不确定性和个人学习不可行的情况下,隐私保护联合边缘协会和电力分配问题。通过分散化部分可见的马尔科夫决策程序(Dec-POMDP)模拟这一问题,通过联合多剂强化学习(FMARL),仅共享加密培训数据,用于联合学习所寻求的政策来解决。我们的模拟结果表明,拟议解决方案在保持比最新解决方案更高的隐私水平的同时,取得了令人信服的平衡。