This study considers multiple reconfigurable intelligent surfaces (RISs)-aided multiuser downlink systems with the goal of jointly optimizing the transmitter precoding and RIS phase shift matrix to maximize spectrum efficiency. Unlike prior work that assumed ideal RIS reflectivity, a practical coupling effect is considered between reflecting amplitude and phase shift for the RIS elements. This makes the optimization problem non-convex. To address this challenge, we propose a deep deterministic policy gradient (DDPG)-based deep reinforcement learning (DRL) framework. The proposed model is evaluated under both fixed and random numbers of users in practical mmWave channel settings. Simulation results demonstrate that, despite its complexity, the proposed DDPG approach significantly outperforms optimization-based algorithms and double deep Q-learning, particularly in scenarios with random user distributions.
翻译:本研究考虑多可重构智能表面(RIS)辅助的多用户下行链路系统,旨在联合优化发射机预编码和RIS相移矩阵以最大化频谱效率。与先前假设理想RIS反射特性的工作不同,本文考虑了RIS单元反射幅度与相移之间的实际耦合效应,这使得优化问题具有非凸性。为应对这一挑战,我们提出了一种基于深度确定性策略梯度(DDPG)的深度强化学习(DRL)框架。该模型在实际毫米波信道环境下,针对固定用户数量和随机用户数量两种场景进行了评估。仿真结果表明,尽管复杂度较高,所提出的DDPG方法显著优于基于优化的算法及双深度Q学习算法,在随机用户分布场景中表现尤为突出。