Fifth-generation (5G) New Radio (NR) cellular networks support a wide range of new services, many of which require an application-specific quality of service (QoS), e.g. in terms of a guaranteed minimum bit-rate or a maximum tolerable delay. Therefore, scheduling multiple parallel data flows, each serving a unique application instance, is bound to become an even more challenging task compared to the previous generations. Leveraging recent advances in deep reinforcement learning, in this paper, we propose a QoS-Aware Deep Reinforcement learning Agent (QADRA) scheduler for NR networks. In contrast to state-of-the-art scheduling heuristics, the QADRA scheduler explicitly optimizes for the QoS satisfaction rate while simultaneously maximizing the network performance. Moreover, we train our algorithm end-to-end on these objectives. We evaluate QADRA in a full scale, near-product, system level NR simulator and demonstrate a significant boost in network performance. In our particular evaluation scenario, the QADRA scheduler improves network throughput by 30% while simultaneously maintaining the QoS satisfaction rate of VoIP users served by the network, compared to state-of-the-art baselines.
翻译:与前几代相比,第五代(5G)新电台(NR)蜂窝网络必然会成为更具挑战性的任务。我们在本文件中提议为NR网络配备一个QOS-Aware 深强化学习仪(QADRA)调度仪(QADRA),以全面、近产品、系统级NR模拟器的方式评估QADRA,并展示网络性能的显著提升。在我们具体的评价设想中,QADRA调度仪将网络的吞吐量提高30%,同时将网络的满意度维持在基准水平上,同时将VOS用户的满意度维持在基准水平上。