Machine-to-Machine (M2M) communication is crucial in developing Internet of Things (IoT). As it is well known that cellular networks have been considered as the primary infrastructure for M2M communications, there are several key issues to be addressed in order to deploy M2M communications over cellular networks. Notably, the rapid growth of M2M traffic dramatically increases energy consumption, as well as degrades the performance of existing Human-to-Human (H2H) traffic. Sustainable operation technology and resource management are efficacious ways for solving these issues. In this paper, we investigate a resource management problem in cellular networks with H2H/M2M coexistence. First, considering the energy-constrained nature of machine type communication devices (MTCDs), we propose a novel network model enabled by simultaneous wireless information and power transfer (SWIPT), which empowers MTCDs with the ability to simultaneously perform energy harvesting (EH) and information decoding. Given the diverse characteristics of IoT devices, we subdivide MTCDs into critical and tolerable types, further formulating the resource management problem as an energy efficiency (EE) maximization problem under divers Quality-of-Service (QoS) constraints. Then, we develop a multi-agent deep reinforcement learning (DRL) based scheme to solve this problem. It provides optimal spectrum, transmit power and power splitting (PS) ratio allocation policies, along with efficient model training under designed behaviour-tracking based state space and common reward function. Finally, we verify that with a reasonable training mechanism, multiple M2M agents successfully work cooperatively in a distributed way, resulting in network performance that outperforms other intelligence approaches in terms of convergence speed and meeting the EE and QoS requirements.
翻译:M2M(M2M)通信对于发展多功能互联网(IoT)至关重要。众所周知,蜂窝网络被认为是M2M通信的主要基础设施,因此,为了在蜂窝网络中部署M2M通信,需要解决几个关键问题。值得注意的是,M2M通信的迅速增长大大增加了能源消耗,降低了现有的人与人(H2H)通信的性能。可持续的操作技术和资源管理是解决这些问题的有效方法。在本文中,我们调查了蜂窝网络中与H2H/M2M共存的手机网络中的资源管理问题。首先,考虑到机器类型通信设备(MTCD)的能源限制性质,我们提出了一个新的网络模式,通过同时的无线信息和电传输(SWIPT),使MTCD有能力同时进行能源采集(EH)和信息解码。鉴于IOT装置的特性不同,我们将MTCD与稳定型和可调控类型相匹配,进一步将资源管理问题发展为基于能源增值的ML(EEE)升级系统,最终在基于成本的网络中,我们通过基于成本的系统化、最终问题,我们以学习的SLDRLDL(S)优化的系统,在学习的系统上,在学习一个基于以优化的系统上,将一个基于以优化的系统上,将存储的系统上,将一个基于以优化的运行的系统化问题发展一个基于以其他的方法,将它提供一种基于以自动的系统进行优化的系统,将一个基于的运行的系统,使自动的系统。