In the typical autonomous driving stack, planning and control systems represent two of the most crucial components in which data retrieved by sensors and processed by perception algorithms are used to implement a safe and comfortable self-driving behavior. In particular, the planning module predicts the path the autonomous car should follow taking the correct high-level maneuver, while control systems perform a sequence of low-level actions, controlling steering angle, throttle and brake. In this work, we propose a model-free Deep Reinforcement Learning Planner training a neural network that predicts both acceleration and steering angle, thus obtaining a single module able to drive the vehicle using the data processed by localization and perception algorithms on board of the self-driving car. In particular, the system that was fully trained in simulation is able to drive smoothly and safely in obstacle-free environments both in simulation and in a real-world urban area of the city of Parma, proving that the system features good generalization capabilities also driving in those parts outside the training scenarios. Moreover, in order to deploy the system on board of the real self-driving car and to reduce the gap between simulated and real-world performances, we also develop a module represented by a tiny neural network able to reproduce the real vehicle dynamic behavior during the training in simulation.
翻译:在典型的自主驾驶堆叠中,规划和控制系统代表了由传感器检索的数据和通过感知算法处理的数据用于实施安全和舒适的自我驾驶行为的最重要的两个最重要的组成部分。特别是,规划模块预测了自动驾驶汽车应当遵循的正确高水平操动的道路,而控制系统则执行一系列低层次的行动,控制方向角、油门和刹车。在这项工作中,我们提议了一个没有模型的深层强化学习计划系统培训一个神经网络,该网络既预测加速度,也预测方向,从而获得一个能够使用自驾驶车上的本地化和感知算法处理的数据驱动车辆的单一模块。特别是,在模拟中经过充分培训的系统能够在模拟和帕尔马市现实城市地区无障碍环境中顺利、安全地驾驶。我们证明该系统具有良好的通用能力,在培训场外的那些地方也能驱动。此外,为了将系统安装在真正的自我驾驶车上,并缩小由自我驾驶汽车进行的地方化和感知识算算算算算算算算器所处理的数据之间的鸿沟。特别是,在模拟过程中经过充分训练的系统能够在模拟和模拟的模拟中进行动态模拟的模拟的车辆的模拟中发展。