Scenario-based testing is a promising approach to solve the challenge of proving the safe behavior of vehicles equipped with automated driving systems (ADS). Since an infinite number of concrete scenarios can theoretically occur in real-world road traffic, the extraction of relevant scenarios that are sensitive regarding the safety-related behavior of ADS-equipped vehicles is a key aspect for the successful verification and validation of these systems. Therefore, this paper provides a method for extracting multimodal urban traffic scenarios from naturalistic road traffic data in an unsupervised manner for minimizing the amount of (potentially biased) prior expert knowledge needed. Rather than an (expensive) rule-based assignment by extracting concrete scenarios into predefined functional scenarios, the presented method deploys an unsupervised machine learning pipeline. It includes principal feature analysis, feature extraction with so-called scenario grids, dimensionality reduction by principal component analysis, scenario clustering as well as cluster validation. The approach allows exploring the unknown natures of the data and interpreting them as scenarios that experts could not have anticipated. The method is demonstrated and evaluated for naturalistic road traffic data at urban intersections from the inD and the Silicon Valley dataset. The findings encourage the use of this type of data as well as unsupervised machine learning approaches as important pillar for a systematic construction of a relevant scenario database with sufficient coverage for testing ADS.
翻译:以假设为基础的测试是解决证明配备自动化驾驶系统(ADS)的车辆安全行为挑战的一个很有希望的方法。由于理论上在现实世界道路交通中可以出现大量具体情景,因此,提取对ADS装备的车辆与安全有关的行为的敏感相关情景是成功核查和验证这些系统的一个关键方面。因此,本文件提供了一种方法,从自然公路交通数据中提取多式城市交通假设情景,以不受监督的方式最大限度地减少专家事先需要的(潜在偏差)专家知识的数量。而不是通过将具体情景带入预先确定的功能情景进行(昂贵的)基于规则的任务,因此,所提出的方法采用了一种不受监督的机器学习管道。它包括主要特征分析,用所谓的情景网格提取特征,按主要部件分析、情景组合组合以及集群验证方式减少维度。这一方法有助于探索这些数据的未知性质,并将这些数据解释为专家无法预料到的情景。该方法通过将城市交叉点的自然性道路交通数据数据演示和评价,而Silicon河谷系统数据覆盖方式则部署一种不受监督的机器,鼓励将这一数据模型用于相关的系统测试。