With the development of deep representation learning, the domain of reinforcement learning (RL) has become a powerful learning framework now capable of learning complex policies in high dimensional environments. This review summarises deep reinforcement learning (DRL) algorithms and provides a taxonomy of automated driving tasks where (D)RL methods have been employed, while addressing key computational challenges in real world deployment of autonomous driving agents. It also delineates adjacent domains such as behavior cloning, imitation learning, inverse reinforcement learning that are related but are not classical RL algorithms. The role of simulators in training agents, methods to validate, test and robustify existing solutions in RL are discussed.
翻译:随着深层代表性学习的发展,强化学习领域已成为一个强大的学习框架,现在能够学习高维环境中的复杂政策,本审查总结了深度强化学习算法,并提供了在采用了(D)RL方法的情况下自动驾驶任务分类,同时解决了在现实世界部署自主驾驶器方面的主要计算挑战,还划定了相邻的领域,如行为克隆、模仿学习、反向强化学习等相关但非传统RL算法,讨论了模拟器在培训代理器中的作用、验证、测试和强化RL现有解决方案的方法。