A novel reconfigurable intelligent surface-aided multi-robot network is proposed, where multiple mobile robots are served by an access point (AP) through non-orthogonal multiple access (NOMA). The goal is to maximize the sum-rate of whole trajectories for multi-robot system by jointly optimizing trajectories and NOMA decoding orders of robots, phase-shift coefficients of the RIS, and the power allocation of the AP, subject to predicted initial and final positions of robots and the quality of service (QoS) of each robot. To tackle this problem, an integrated machine learning (ML) scheme is proposed, which combines long short-term memory (LSTM)-autoregressive integrated moving average (ARIMA) model and dueling double deep Q-network (D$^{3}$QN) algorithm. For initial and final position prediction for robots, the LSTM-ARIMA is able to overcome the problem of gradient vanishment of non-stationary and non-linear sequences of data. For jointly determining the phase shift matrix and robots' trajectories, D$^{3}$QN is invoked for solving the problem of action value overestimation. Based on the proposed scheme, each robot holds a global optimal trajectory based on the maximum sum-rate of a whole trajectory, which reveals that robots pursue long-term benefits for whole trajectory design. Numerical results demonstrated that: 1) LSTM-ARIMA model provides high accuracy predicting model; 2) The proposed D$^{3}$QN algorithm can achieve fast average convergence; 3) The RIS with higher resolution bits offers a bigger sum-rate of trajectories than lower resolution bits; and 4) RIS-NOMA networks have superior network performance compared to RIS-aided orthogonal counterparts.
翻译:提议建立一个新颖的可重新配置智能表面辅助多机器人网络,让多个移动机器人通过非垂直多重访问(NOMA)获得接入点(AP)服务。目标是通过联合优化机器人的轨迹和NOMA解码命令、RIS的阶段档系数和AP的功率分配,但须考虑到机器人的预测初始和最终位置以及每个机器人的服务质量(QOS)。为了解决这个问题,提议了一个集成机器学习(ML)计划,将长期短期内存(LSTM)-潜流综合平均移动(ARIMA)模型和双深Q网络(D$3QN)算法解码。对于机器人的初始和最后位置预测,LSTM-ARIMA能够克服非静止和非直线序列(QS)的渐变问题。为了共同确定长期内流模型(LIMAR3)的运行结果,每个阶段内流模型和机器人的预置的轨迹(DMAR3)预算,可以比机器人的平流流流流流流速度(QQQQ)的计算出一个快速移动模型。