The unmanned aerial vehicle (UAV)-enabled communication technology is regarded as an efficient and effective solution for some special application scenarios where existing terrestrial infrastructures are overloaded to provide reliable services. To maximize the utility of the UAV-enabled system while meeting the QoS and energy constraints, the UAV needs to plan its trajectory considering the dynamic characteristics of scenarios, which is formulated as the Markov Decision Process (MDP). To solve the above problem, a deep reinforcement learning (DRL)-based scheme is proposed here, which predicts the trend of the dynamic scenarios to provide a long-term view for the UAV trajectory planning. Simulation results validate that our proposed scheme converges more quickly and achieves the better performance in dynamic scenarios.
翻译:无人驾驶飞行器(无人驾驶飞行器)的辅助通信技术被认为是一些特殊应用情景的高效和有效解决办法,在这些情景中,现有地面基础设施超载,无法提供可靠的服务;为了在满足QOS和能源限制的同时最大限度地发挥无人驾驶飞行器辅助系统的效用,无人驾驶飞行器需要规划其轨迹,考虑到作为Markov决定程序(MDP)拟订的各种情景的动态特点;为解决上述问题,在此提议了一项基于深度强化学习(DRL)计划,该计划预测动态情景的趋势,以便为无人驾驶飞行器的轨迹规划提供长期的视角;模拟结果证实,我们拟议的计划将更快地汇合,并在动态情景中取得更好的业绩。