5G is regarded as a revolutionary mobile network, which is expected to satisfy a vast number of novel services, ranging from remote health care to smart cities. However, heterogeneous Quality of Service (QoS) requirements of different services and limited spectrum make the radio resource allocation a challenging problem in 5G. In this paper, we propose a multi-agent reinforcement learning (MARL) method for radio resource slicing in 5G. We model each slice as an intelligent agent that competes for limited radio resources, and the correlated Q-learning is applied for inter-slice resource block (RB) allocation. The proposed correlated Q-learning based interslice RB allocation (COQRA) scheme is compared with Nash Q-learning (NQL), Latency-Reliability-Throughput Q-learning (LRTQ) methods, and the priority proportional fairness (PPF) algorithm. Our simulation results show that the proposed COQRA achieves 32.4% lower latency and 6.3% higher throughput when compared with LRTQ, and 5.8% lower latency and 5.9% higher throughput than NQL. Significantly higher throughput and lower packet drop rate (PDR) is observed in comparison to PPF.
翻译:5G被视为一个革命性的流动网络,它预计将满足从远程保健到智能城市等众多新服务,从远程保健到智能城市,但不同服务和有限频谱的服务质量要求各异,使5G的无线电资源分配成为5G的棘手问题。 在本文中,我们提议对5G的无线电资源切片进行多试剂强化学习(MARL)方法。我们将每个切片作为智能剂进行模型,竞争有限的无线电资源,并将相关的Q学习用于切片间资源区块分配。拟议的基于相互学习的基于相互学习的跨链(COQRA)计划与Nash Q(NQL)学习(NQL)、Latency-Refority-Tript Q(LRTQ)方法以及优先比例公平(PPF)算法相比较。我们的模拟结果表明,拟议的COQRA比LRTQ低32.4%的拉特和6.3%的比NQL.PF的低拉特和低压率(观察到的PDR)。