This paper studies the problem of massive Internet of things (IoT) access in beyond fifth generation (B5G) networks using non-orthogonal multiple access (NOMA) technique. The problem involves massive IoT devices grouping and power allocation in order to respect the low latency as well as the limited operating energy of the IoT devices. The considered objective function, maximizing the number of successfully received IoT packets, is different from the classical sum-rate-related objective functions. The problem is first divided into multiple NOMA grouping subproblems. Then, using competitive analysis, an efficient online competitive algorithm (CA) is proposed to solve each subproblem. Next, to solve the power allocation problem, we propose a new reinforcement learning (RL) framework in which a RL agent learns to use the CA as a black box and combines the obtained solutions to each subproblem to determine the power allocation for each NOMA group. Our simulations results reveal that the proposed innovative RL framework outperforms deep-Q-learning methods and is close-to-optimal.
翻译:本文研究的是五代以上物质(B5G)网络使用非横向多重接入技术在第五代(B5G)之后大规模上网的问题。问题涉及大规模互联网设备分组和电力分配,以尊重低潜值以及IoT设备有限的操作能量。考虑的客观功能,最大限度地增加成功获得的IoT包的数量,不同于传统的总和率目标功能。问题首先分为多个NOMA分组子问题。然后,利用竞争分析,建议一种高效的在线竞争性算法(CA)来解决每个子问题。接下来,为解决电力分配问题,我们提议一个新的强化学习框架,即一个RL代理机构学习使用CA作为黑盒子,并将获得的每个子问题的解决方案结合起来,以确定每个NOMA组的电力分配。我们的模拟结果表明,拟议的RL框架超越了深层次学习方法,是接近于opimal的。