Autonomous navigation is challenging for mobile robots, especially in an unknown environment. Commonly, the robot requires multiple sensors to map the environment, locate itself, and make a plan to reach the target. However, reinforcement learning methods offer an alternative to map-free navigation tasks by learning the optimal actions to take. In this article, deep reinforcement learning agents are implemented using variants of the deep Q networks method, the D3QN and rainbow algorithms, for both the obstacle avoidance and the goal-oriented navigation task. The agents are trained and evaluated in a simulated environment. Furthermore, an analysis of the changes in the behaviour and performance of the agents caused by modifications in the reward function is conducted.
翻译:自主导航对移动机器人来说具有挑战性,特别是在未知环境中。通常,机器人需要多个传感器来绘制环境地图、定位自己并制订达到目标的计划。然而,强化学习方法通过学习最佳行动,为无地图导航任务提供了替代方法。在本条中,采用深Q网络方法、D3QN和彩虹算法等变体来实施深强化学习代理,既可以避免障碍,也可以开展面向目标的导航任务。这些代理器在模拟环境中接受培训和评价。此外,还分析了因奖励功能的改变而导致的代理器行为和性能的变化。