In this paper, we study the resource allocation problem for an intelligent reflecting surface (IRS)-assisted OFDM system. The system sum rate maximization framework is formulated by jointly optimizing subcarrier allocation, base station transmit beamforming and IRS phase shift. Considering the continuous and discrete hybrid action space characteristics of the optimization variables, we propose an efficient resource allocation algorithm combining multiple deep Q networks (MDQN) and deep deterministic policy-gradient (DDPG) to deal with this issue. In our algorithm, MDQN are employed to solve the problem of large discrete action space, while DDPG is introduced to tackle the continuous action allocation. Compared with the traditional approaches, our proposed MDQN-DDPG based algorithm has the advantage of continuous behavior improvement through learning from the environment. Simulation results demonstrate superior performance of our design in terms of system sum rate compared with the benchmark schemes.
翻译:在本文中,我们研究了智能反射表面(IRS)辅助的OFDM系统的资源配置问题。系统总和率最大化框架是通过联合优化子载体分配、基站传送波束成形和IRS阶段转移制定的。考虑到优化变数的连续和离散混合行动空间特点,我们提出了一种高效的资源分配算法,将多个深Q网络(MDQN)和深度决定式政策梯度(DDPG)结合起来来处理这一问题。在我们的算法中,MDQN用于解决大型离散行动空间的问题,而DDPG则用于解决连续行动分配问题。与传统方法相比,我们提议的MDQN-DPG算法具有通过从环境中学习不断改进行为的优势。模拟结果表明,我们设计的系统总和率比基准计划要高。