In recent years, Deep Reinforcement Learning emerged as a promising approach for autonomous navigation of ground vehicles and has been utilized in various areas of navigation such as cruise control, lane changing, or obstacle avoidance. However, most research works either focus on providing an end-to-end solution training the whole system using Deep Reinforcement Learning or focus on one specific aspect such as local motion planning. This however, comes along with a number of problems such as catastrophic forgetfulness, inefficient navigation behavior, and non-optimal synchronization between different entities of the navigation stack. In this paper, we propose a holistic Deep Reinforcement Learning training approach in which the training procedure is involving all entities of the navigation stack. This should enhance the synchronization between- and understanding of all entities of the navigation stack and as a result, improve navigational performance. We trained several agents with a number of different observation spaces to study the impact of different input on the navigation behavior of the agent. In profound evaluations against multiple learning-based and classic model-based navigation approaches, our proposed agent could outperform the baselines in terms of efficiency and safety attaining shorter path lengths, less roundabout paths, and less collisions.
翻译:近年来,深加学习成为地面车辆自主航行的一个很有希望的方法,在航行控制、更换航道或避免障碍等不同航行领域得到了利用,然而,大多数研究工作的重点要么是利用深加学习提供整个系统的端到端解决方案培训,要么是侧重于一个具体方面,例如地方运动规划,然而,这伴随着一系列问题,例如灾难性的遗忘、低效率的航行行为和导航堆叠不同实体之间不优化的同步。在本文中,我们建议采用整体的深加学习培训方法,培训程序涉及导航堆的所有实体。这将加强导航堆的所有实体之间的同步和理解,从而改进航行性能。我们培训了若干具有不同观测空间的代理人,研究不同投入对代理人航行行为的影响。在对多种基于学习的经典模型导航方法进行深入评估时,我们提议的代理人在效率与安全基线方面可能超越了基线,从而缩短了航道长度,减少环绕路径,减少碰撞。