Autonomous vehicles have the potential to lower the accident rate when compared to human driving. Moreover, it is the driving force of the automated vehicles' rapid development over the last few years. In the higher Society of Automotive Engineers (SAE) automation level, the vehicle's and passengers' safety responsibility is transferred from the driver to the automated system, so thoroughly validating such a system is essential. Recently, academia and industry have embraced scenario-based evaluation as the complementary approach to road testing, reducing the overall testing effort required. It is essential to determine the system's flaws before deploying it on public roads as there is no safety driver to guarantee the reliability of such a system. This paper proposes a Reinforcement Learning (RL) based scenario-based falsification method to search for a high-risk scenario in a pedestrian crossing traffic situation. We define a scenario as risky when a system under testing (SUT) does not satisfy the requirement. The reward function for our RL approach is based on Intel's Responsibility Sensitive Safety(RSS), Euclidean distance, and distance to a potential collision.
翻译:与人驾驶相比,自治车辆有可能降低事故率。此外,这是过去几年自动车辆快速发展的驱动力。在汽车工程师协会(SAE)自动化水平较高的地方,车辆和乘客的安全保障责任从驾驶员转移到自动化系统,因此,彻底验证这种系统至关重要。最近,学术界和工业界将情景评估作为道路测试的补充方法,减少了所需的总体测试努力。在将系统部署在公共道路上之前,必须先确定系统的缺陷,因为没有安全驱动器来保证系统可靠性。本文建议采用基于环境的强化学习(RL)假想方法,在行人穿越交通时寻找高风险情景。我们定义了一种风险情景,即测试中的系统(SUT)不能满足要求。我们的RL方法的奖励功能基于Intel的责任敏感安全(RSS)、Euclidean距离和潜在碰撞的距离。