There is an increasing demand for using Unmanned Aerial Vehicle (UAV), known as drones, in different applications such as packages delivery, traffic monitoring, search and rescue operations, and military combat engagements. In all of these applications, the UAV is used to navigate the environment autonomously - without human interaction, perform specific tasks and avoid obstacles. Autonomous UAV navigation is commonly accomplished using Reinforcement Learning (RL), where agents act as experts in a domain to navigate the environment while avoiding obstacles. Understanding the navigation environment and algorithmic limitations plays an essential role in choosing the appropriate RL algorithm to solve the navigation problem effectively. Consequently, this study first identifies the main UAV navigation tasks and discusses navigation frameworks and simulation software. Next, RL algorithms are classified and discussed based on the environment, algorithm characteristics, abilities, and applications in different UAV navigation problems, which will help the practitioners and researchers select the appropriate RL algorithms for their UAV navigation use cases. Moreover, identified gaps and opportunities will drive UAV navigation research.
翻译:使用无人驾驶航空飞行器(无人驾驶飞行器),即无人驾驶飞行器(无人驾驶飞行器)的需求日益增长,这种飞行器被用于各种应用,如包件交付、交通监测、搜索和救援行动以及军事战斗任务等,在所有这些应用中,无人驾驶飞行器都用于自主地在环境中航行 -- -- 没有人际互动,执行具体任务和避免障碍;自主无人驾驶飞行器导航通常使用强化学习(RL)完成,代理器在某一领域充当专家,在航行环境上航行,同时避免障碍;了解导航环境和算法限制在选择适当的遥控飞行器算法以有效解决导航问题方面发挥着至关重要的作用;因此,本研究报告首先确定了无人驾驶飞行器的主要导航任务,并讨论了导航框架和模拟软件;接着,根据环境、算法特点、能力和不同导航问题的应用,对导航算法进行了分类和讨论,这将有助于从业人员和研究人员为无人驾驶飞行器导航使用案例选择适当的遥控算法。此外,已查明的差距和机会将推动无人驾驶飞行器的导航研究。