In this paper, we investigate a reconfigurable intelligent surface (RIS)-aided multiuser full-duplex secure communication system with hardware impairments at transceivers and RIS, where multiple eavesdroppers overhear the two-way transmitted signals simultaneously, and an RIS is applied to enhance the secrecy performance. Aiming at maximizing the sum secrecy rate (SSR), a joint optimization problem of the transmit beamforming at the base station (BS) and the reflecting beamforming at the RIS is formulated under the transmit power constraint of the BS and the unit modulus constraint of the phase shifters. As the environment is time-varying and the system is high-dimensional, this non-convex optimization problem is mathematically intractable. A deep reinforcement learning (DRL)-based algorithm is explored to obtain the satisfactory solution by repeatedly interacting with and learning from the dynamic environment. Extensive simulation results illustrate that the DRL-based secure beamforming algorithm is proved to be significantly effective in improving the SSR. It is also found that the performance of the DRL-based method can be greatly improved and the convergence speed of neural network can be accelerated with appropriate neural network parameters.
翻译:在本文中,我们调查了一个可重新整合的智能表面(RIS)辅助多用户全复式安全通信系统,该系统在收发器和RIS上有硬件缺陷,多电子监听器同时听到双向传输信号,并应用RIS来提高保密性能。为了最大限度地提高总保密率(SSR),在基地站(BS)的传输波束和RIS的反射波的优化联合问题,是在BS的传输力限制和阶段转换器的单元模范限制下形成的。由于环境是时间变化式的,系统是高度的,这种非电流优化问题在数学上是难以处理的。正在探索一种深加固学习(DRL)的算法,以便通过与动态环境反复互动和学习以获得令人满意的解决办法。广泛的模拟结果表明,基于DRL的安全成型算法在改进安全部门改革方面证明非常有效。还发现,基于NRL的计算方法的性能可以与适当的网络参数大大改进,并加快其速度。