In urban environments, the complex and uncertain intersection scenarios are challenging for autonomous driving. To ensure safety, it is crucial to develop an adaptive decision making system that can handle the interaction with other vehicles. Manually designed model-based methods are reliable in common scenarios. But in uncertain environments, they are not reliable, so learning-based methods are proposed, especially reinforcement learning (RL) methods. However, current RL methods need retraining when the scenarios change. In other words, current RL methods cannot reuse accumulated knowledge. They forget learned knowledge when new scenarios are given. To solve this problem, we propose a hierarchical framework that can autonomously accumulate and reuse knowledge. The proposed method combines the idea of motion primitives (MPs) with hierarchical reinforcement learning (HRL). It decomposes complex problems into multiple basic subtasks to reduce the difficulty. The proposed method and other baseline methods are tested in a challenging intersection scenario based on the CARLA simulator. The intersection scenario contains three different subtasks that can reflect the complexity and uncertainty of real traffic flow. After offline learning and testing, the proposed method is proved to have the best performance among all methods.
翻译:在城市环境中,复杂和不确定的交叉情景对自主驾驶具有挑战性。为确保安全,必须开发一个适应性决策系统,能够处理与其他车辆的互动。手工设计的基于模型的方法在共同情景中是可靠的。但在不确定的环境中,它们并不可靠,因此提出了基于学习的方法,特别是强化学习的方法。然而,当情景变化时,目前的RL方法需要再培训。换句话说,目前的RL方法无法再利用积累的知识。当出现新的情景时,它们忘记了所学的知识。为了解决这个问题,我们建议了一个可自主积累和再利用知识的等级框架。拟议的方法将运动原始产品的概念与等级强化学习结合起来。它将复杂的问题分解成多个基本的子任务来减少困难。拟议的方法和其他基线方法在基于 CARLA 模拟器的具有挑战性的交叉情景中测试。交叉情景包含三个不同的子任务,能够反映实际交通流量的复杂性和不确定性。在离线学习和测试后,拟议的方法被证明在所有方法中都具有最佳性。