联合加强深层学习和卸载:光束选择和用于毫米Wave多用户多用户MIMIMO和激光阵列的预编码 (Joint Deep Reinforcement Learning and Unfolding: Beam Selection and Precoding for mmWave Multiuser MIMO with Lens Arrays)

The millimeter wave (mmWave) multiuser multiple-input multiple-output (MU-MIMO) systems with discrete lens arrays (DLA) have received great attention due to their simple hardware implementation and excellent performance. In this work, we investigate the joint design of beam selection and digital precoding matrices for mmWave MU-MIMO systems with DLA to maximize the sum-rate subject to the transmit power constraint and the constraints of the selection matrix structure. The investigated non-convex problem with discrete variables and coupled constraints is challenging to solve and an efficient framework of joint neural network (NN) design is proposed to tackle it. Specifically, the proposed framework consists of a deep reinforcement learning (DRL)-based NN and a deep-unfolding NN, which are employed to optimize the beam selection and digital precoding matrices, respectively. As for the DRL-based NN, we formulate the beam selection problem as a Markov decision process and a double deep Q-network algorithm is developed to solve it. The base station is considered to be an agent, where the state, action, and reward function are carefully designed. Regarding the design of the digital precoding matrix, we develop an iterative weighted minimum mean-square error algorithm induced deep-unfolding NN, which unfolds this algorithm into a layerwise structure with introduced trainable parameters. Simulation results verify that this jointly trained NN remarkably outperforms the existing iterative algorithms with reduced complexity and stronger robustness.

翻译：使用离散透镜阵列的多用户多输出多输出(MM-MIMO)系统(MM-MIMO)的多用户多输出(MMU-MIMO)系统因其简单的硬件实施和出色性能而受到极大关注。在这项工作中,我们调查了MMWE MU-MIMO系统(DLA)的光束选择和数字预编码矩阵联合设计设计设计联合设计框架(NNNN),与DLA的光束选择和数字预编码矩阵联合设计。与基于DRL的电源限制和选择矩阵结构的制约一样,我们把光标选择问题设计成一个离散的变异变量和连带的参数,对于解决问题来说是困难的。具体来说,拟议框架包括一个深度强化学习(DRL) NNNW和一个深度加密矩阵,分别用来优化光标选择和数字预编码的矩阵选择。我们把光标选择问题当作一个可选择的马可选决定过程和一个双深层次的网络算法来解决这个问题。基地站被认为是一个代理,,在这个结构中,我们精心设计了一个精细的正的模型的模型的模型, 。将一个精确的模型的模型的模型的模型的模型的模型的模型的模型的模型是用来设计, 。