The unmanned aerial vehicle (UAV) plays an vital role in various applications such as delivery, military mission, disaster rescue, communication, etc., due to its flexibility and versatility. This paper proposes a deep reinforcement learning method to solve the UAV online routing problem with wireless power transfer, which can charge the UAV remotely without wires, thus extending the capability of the battery-limited UAV. Our study considers the power consumption of the UAV and the wireless charging process. Unlike the previous works, we solve the problem by a designed deep neural network. The model is trained using a deep reinforcement learning method offline, and is used to optimize the UAV routing problem online. On small and large scale instances, the proposed model runs from four times to 500 times faster than Google OR-tools, the state-of-the-art combinatorial optimization solver, with identical solution quality. It also outperforms different types of heuristic and local search methods in terms of both run-time and optimality. In addition, once the model is trained, it can scale to new generated problem instances with arbitrary topology that are not seen during training. The proposed method is practically applicable when the problem scale is large and the response time is crucial.
翻译:无人驾驶航空器(UAV)由于其灵活性和多功能性,在交付、军事任务、救灾、通信等各种应用中发挥着关键作用。本文件提出一个深强化学习方法,以解决无人驾驶飞行器在线线路问题,无线电源传输,无线电传输可以对无人驾驶飞行器进行远程充电,从而扩大无线无人驾驶飞行器的能力。我们的研究认为无人驾驶飞行器和无线充电过程的耗能与以往不同。我们与以往的工程不同,我们通过设计出一个深层神经网络来解决问题。模型通过离线的深度强化学习方法进行训练,并用于优化无人驾驶飞行器的在线路由问题。在小型和大型实例中,拟议模型的运行速度比Google OR-工具快四至500倍,最先进的组合优化处理器质量相同。我们的研究还考虑了无人驾驶飞行器和无线充电过程的能量消耗和无线。此外,模型一旦经过培训,就可以将生成的任意地表层问题放大,在培训期间无法看到。拟议的方法在实际反应中是关键性的。