A novel reconfigurable intelligent surface-aided multi-robot network is proposed, where multiple mobile robots are served by an access point (AP) through non-orthogonal multiple access (NOMA). The goal is to maximize the sum-rate of whole trajectories for the multi-robot system by jointly optimizing trajectories and NOMA decoding orders of robots, phase-shift coefficients of the RIS, and the power allocation of the AP, subject to predicted initial and final positions of robots and the quality of service (QoS) of each robot. To tackle this problem, an integrated machine learning (ML) scheme is proposed, which combines long short-term memory (LSTM)-autoregressive integrated moving average (ARIMA) model and dueling double deep Q-network (D$^{3}$QN) algorithm. For initial and final position prediction for robots, the LSTM-ARIMA is able to overcome the problem of gradient vanishment of non-stationary and non-linear sequences of data. For jointly determining the phase shift matrix and robots' trajectories, D$^{3}$QN is invoked for solving the problem of action value overestimation. Based on the proposed scheme, each robot holds an optimal trajectory based on the maximum sum-rate of a whole trajectory, which reveals that robots pursue long-term benefits for whole trajectory design. Numerical results demonstrated that: 1) LSTM-ARIMA model provides high accuracy predicting model; 2) The proposed D$^{3}$QN algorithm can achieve fast average convergence; and 3) RIS-NOMA networks have superior network performance compared to RIS-aided orthogonal counterparts.
翻译:提议建立一个新颖的智能智能表面辅助多机器人网络,其中多个移动机器人通过非垂直多重访问(NOMA)获得接入点(AP)服务。目标是通过联合优化机器人的轨迹和NOMA解码命令、RIS的级档系数和AP的动力分配,以预测机器人的初始和最终位置以及每个机器人的服务质量(QOS)为条件。为了解决这一问题,提议了一个集成机器学习(ML)计划,将多机器人系统整个轨迹的总数最大化,为此将长期短期内存(LSTM)-潜流综合移动平均(ARIMA)模型和双深Q网络(D$3}QMA解码解码,为机器人的初始和最终定位预测,LSTM-ARIMA基于提议的非静止和非线性服务质量(QS) 。为了共同确定阶段内轨轨流3的模型和机器人的运行状态,将显示每步轨迹的轨迹的轨迹的轨迹的轨迹的轨迹的轨迹。</s>