Internet of things, supported by machine-to-machine (M2M) communications, is one of the most important applications for future 6th generation (6G) systems. A major challenge facing by 6G is enabling a massive number of M2M devices to access networks in a timely manner. Therefore, this paper exploits the spatial selectivity of massive multi-input multi-output (MIMO) to reduce the collision issue when massive M2M devices initiate random access simultaneously. In particular, a beam-based random access protocol is first proposed to make efficient use of the limited uplink resources for massive M2M devices. To address the non-uniform distribution of M2M devices in the space and time dimensions, an Markov decision process (MDP) problem with the objective of minimizing the average access delay is then formulated. Next, we present a dynamic beam-based access scheme based on the double deep Q network (DDQN) algorithm to solve the optimal policy. Finally, simulations are conducted to demonstrate the effectiveness of the proposed scheme including the model training and random access performance.
翻译:由机器到机器(M2M)通信支持的物体互联网是未来第6代(6G)系统的最重要应用之一。6G面临的一个主要挑战是使大量M2M设备能够及时进入网络。因此,本文件利用了大规模多投入多输出量的空间选择,以减少大型M2M设备同时启动随机访问时的碰撞问题。特别是,首先提议了以波束为基础的随机访问协议,以便有效利用大型M2M设备的有限连接资源。为了解决空间和时间层面M2M设备的非统一分布问题,随后制定了旨在尽量减少平均访问延迟的Markov决定程序(MDP),然后我们根据双深Q网络(DDQN)算法提出了一个动态的以波束为基础的访问计划,以解决最佳政策。最后,进行了模拟,以证明拟议计划的有效性,包括示范培训和随机访问性表现。