It is anticipated that the era of fully autonomous vehicle operations will be preceded by a lengthy "Transition Period" where the traffic stream will be mixed, that is, consisting of connected autonomous vehicles (CAVs), human-driven vehicles (HDVs) and connected human-driven vehicles (CHDVs). In recognition of the fact that public acceptance of CAVs will hinge on safety performance of automated driving systems, and that there will likely be safety challenges in the early part of the transition period, significant research efforts have been expended in the development of safety-conscious automated driving systems. Yet still, there appears to be a lacuna in the literature regarding the handling of the crash-imminent situations that are caused by errant human driven vehicles (HDVs) in the vicinity of the CAV during operations on the roadway. In this paper, we develop a simple model-based Reinforcement Learning (RL) based system that can be deployed in the CAV to generate trajectories that anticipate and avoid potential collisions caused by drivers of the HDVs. The model involves an end-to-end data-driven approach that contains a motion prediction model based on deep learning, and a fast trajectory planning algorithm based on model predictive control (MPC). The proposed system requires no prior knowledge or assumption about the physical environment including the vehicle dynamics, and therefore represents a general approach that can be deployed on any type of vehicle (e.g., truck, buse, motorcycle, etc.). The framework is trained and tested in the CARLA simulator with multiple collision imminent scenarios, and the results indicate the proposed model can avoid the collision at high successful rate (>85%) even in highly compact and dangerous situations.
翻译:预计,在完全自主的车辆运行时代之前,将有一个漫长的“过渡阶段”,即交通流量将混合在一起,即由连接的自动车辆(CAVs)、人驱动车辆(HDVs)和人驱动车辆(CHDVs)组成。认识到公众对CAV的接受取决于自动驾驶系统的安全性能,在过渡期初期可能存在安全挑战,在开发安全意识的机动车自动驾驶系统方面已经进行了大量研究。然而,在文献中似乎存在着关于处理由在公路上运行的错误的人驱动车辆(HDVs)和连接的人驱动车辆(CHDVs)造成的坠毁-模拟情况的空白。在本文中,我们开发了一个简单的基于模型的强化学习系统,可以在CAVAV产生模型的轨迹,预测和避免由驱动的多功能驱动的多功能系统造成的潜在碰撞。该模型涉及最终到终端的模型数据驱动方法,在轨迹上,在轨迹上进行快速预测的模型或机车动模型,因此,基于前期预测的系统需要先期的预测,在深层环境中进行。