Indoor multi-robot communications face two key challenges: one is the severe signal strength degradation caused by blockages (e.g., walls) and the other is the dynamic environment caused by robot mobility. To address these issues, we consider the reconfigurable intelligent surface (RIS) to overcome the signal blockage and assist the trajectory design among multiple robots. Meanwhile, the non-orthogonal multiple access (NOMA) is adopted to cope with the scarcity of spectrum and enhance the connectivity of robots. Considering the limited battery capacity of robots, we aim to maximize the energy efficiency by jointly optimizing the transmit power of the access point (AP), the phase shifts of the RIS, and the trajectory of robots. A novel federated deep reinforcement learning (F-DRL) approach is developed to solve this challenging problem with one dynamic long-term objective. Through each robot planning its path and downlink power, the AP only needs to determine the phase shifts of the RIS, which can significantly save the computation overhead due to the reduced training dimension. Simulation results reveal the following findings: I) the proposed F-DRL can reduce at least 86% convergence time compared to the centralized DRL; II) the designed algorithm can adapt to the increasing number of robots; III) compared to traditional OMA-based benchmarks, NOMA-enhanced schemes can achieve higher energy efficiency.
翻译:室内多机器人通信面临两大挑战:一是阻塞(如墙壁)造成的严重信号强度衰减,另一是机器人流动性造成的动态环境。为了解决这些问题,我们考虑重新配置智能表面(RIS),以克服信号阻塞,并协助多机器人的轨迹设计。与此同时,采用非垂直多存取(NOMA),以应对频谱稀缺,增强机器人的连通性。考虑到机器人电池容量有限,我们的目标是通过联合优化接入点(AP)的传输能力、RIS的阶段转变以及机器人的轨迹,最大限度地提高能效。为了解决这些问题,我们考虑重新配置智能表面(RIS),以克服信号阻塞,并协助多个机器人的轨迹设计。同时,采用非垂直多存取(NOMA)系统(NOMA)系统仅需要确定RIS的阶段变化,这可以大大节省由于培训层面的减少而导致的计算间接费用。模拟结果显示:I)拟议的F-DRL系统更高的传输能力、RIS的阶段转变以及机器人的轨迹。F-DRMA-L系统在最短的频率上可以使IMA-II系统升级到III。