Using Machine Learning (ML) techniques for the next generation wireless networks have shown promising results in the recent years, due to high learning and adaptation capability of ML algorithms. More specifically, ML techniques have been used for load balancing in Self-Organizing Networks (SON). In the context of load balancing and ML, several studies propose network management automation (NMA) from the perspective of a single and centralized agent. However, a single agent domain does not consider the interaction among the agents. In this paper, we propose a more realistic load balancing approach using novel Multi-Agent Deep Deterministic Policy Gradient with Adaptive Policies (MADDPG-AP) scheme that considers throughput, resource block utilization and latency in the network. We compare our proposal with a single-agent RL algorithm named Clipped Double Q-Learning (CDQL) . Simulation results reveal a significant improvement in latency, packet loss ratio and convergence time
翻译:近年来,由于ML算法的学习和适应能力高,为下一代无线网络使用机器学习(ML)技术取得了令人乐观的成果,更具体地说,ML技术被用于自组织网络(SON)的负负平衡;在负载平衡和ML方面,有几项研究从单一和集中的代理商的角度提出网络管理自动化(NMA)建议;然而,一个单一的代理商域不考虑代理商之间的相互作用。在本文件中,我们提议采用更现实的负载平衡方法,采用新的多力深度决定政策(MADDPG-AP)办法,其中考虑到网络的吞吐量、资源块利用和耐久性。我们将我们的提案与名为Clipt 双QL(CDQL)的单一代理商的RL算法进行比较。模拟结果显示,在惯性、包损率和趋同时间方面有很大改进。