In this survey, we systematically summarize the current literature on studies that apply reinforcement learning (RL) to the motion planning and control of autonomous vehicles. Many existing contributions can be attributed to the pipeline approach, which consists of many hand-crafted modules, each with a functionality selected for the ease of human interpretation. However, this approach does not automatically guarantee maximal performance due to the lack of a system-level optimization. Therefore, this paper also presents a growing trend of work that falls into the end-to-end approach, which typically offers better performance and smaller system scales. However, their performance also suffers from the lack of expert data and generalization issues. Finally, the remaining challenges applying deep RL algorithms on autonomous driving are summarized, and future research directions are also presented to tackle these challenges.
翻译:在这次调查中,我们系统地总结了目前关于将强化学习(RL)应用于自主车辆运动规划和控制的研究文献,许多现有贡献可归因于编审方法,其中包括许多手工制作的模块,每个模块都具有便于人解释的功能,然而,由于缺乏系统一级的优化,这一方法并不自动保证最大性能;因此,本文还呈现了越来越多的工作趋势,属于端对端方法,通常提供更好的性能和较小的系统规模;然而,它们的业绩也因缺乏专家数据和一般化问题而受到影响;最后,总结了在自主驾驶方面运用深度RL算法的剩余挑战,并提出了应对这些挑战的未来研究方向。