As a main use case of 5G and Beyond wireless network, the ever-increasing machine type communications (MTC) devices pose critical challenges over MTC network in recent years. It is imperative to support massive MTC devices with limited resources. To this end, Non-orthogonal multiple access (NOMA) based random access network has been deemed as a prospective candidate for MTC network. In this paper, we propose a deep reinforcement learning (RL) based approach for NOMA-based random access network with truncated channel inversion power control. Specifically, each MTC device randomly selects a pre-defined power level with a certain probability for data transmission. Devices are using channel inversion power control yet subject to the upper bound of the transmission power. Due to the stochastic feature of the channel fading and the limited transmission power, devices with different achievable power levels have been categorized as different types of devices. In order to achieve high throughput with considering the fairness between all devices, two objective functions are formulated. One is to maximize the minimum long-term expected throughput of all MTC devices, the other is to maximize the geometric mean of the long-term expected throughput for all MTC devices. A Policy based deep reinforcement learning approach is further applied to tune the transmission probabilities of each device to solve the formulated optimization problems. Extensive simulations are conducted to show the merits of our proposed approach.
翻译:作为5G和5G无线网络的主要使用案例,不断增长的机器类型通信设备近年来对MTC网络构成重大挑战,必须支持资源有限的大型MTC设备。为此,基于非横向多重访问(NOMA)随机访问网络被视为MTC网络的潜在候选对象。在本文件中,我们建议对基于NOMA的随机访问网络采用基于深度强化学习(RL)的方法,并配有短道通道反向电源控制。具体地说,每个MTC设备随机地选择了预先确定的权力水平,并有一定的数据传输概率。设备正在使用频道反向电源控制,但受传输能力上限的限制。由于频道的模糊性和有限的传输能力,因此将具有不同可实现能力水平的装置归类为不同类型的装置。为了在考虑所有装置之间的公平性的情况下实现高传输率,我们制定了两个目标功能。为了最大限度地实现所有MTC设备的最低长期预期值,其他设备正在使用频道反向动力控制,但受传输能力控制,但受传输能力上限的限制。由于频道的模糊性和有限的传输能力,因此,因此,对以不同可实现不同功能的拟议电流流分析方法进行最大程度的升级。