In this paper, we develop a safe decision-making method for self-driving cars in a multi-lane, single-agent setting. The proposed approach utilizes deep reinforcement learning (RL) to achieve a high-level policy for safe tactical decision-making. We address two major challenges that arise solely in autonomous navigation. First, the proposed algorithm ensures that collisions never happen, and therefore accelerate the learning process. Second, the proposed algorithm takes into account the unobservable states in the environment. These states appear mainly due to the unpredictable behavior of other agents, such as cars, and pedestrians, and make the Markov Decision Process (MDP) problematic when dealing with autonomous navigation. Simulations from a well-known self-driving car simulator demonstrate the applicability of the proposed method
翻译:在本文中,我们为在多车道、单一试剂环境下自行驾驶的汽车制定了安全的决策方法。拟议方法利用深度强化学习(RL)实现安全战术决策的高层政策。我们应对了完全在自主导航中出现的两大挑战。首先,拟议的算法确保了碰撞永远不会发生,从而加快了学习过程。第二,拟议的算法考虑到了环境中不可观察的状态。这些国家似乎主要是因为其他代理人,如汽车和行人的行为不可预测,使Markov决策过程在处理自主导航时出现问题。一个众所周知的自我驾驶模拟木车模拟器模拟了拟议方法的适用性。