As autonomous vehicles (AVs) take on growing Operational Design Domains (ODDs), they need to go through a systematic, transparent, and scalable evaluation process to demonstrate their benefits to society. Current scenario sampling techniques for AV performance evaluation usually focus on a specific functionality, such as lane changing, and do not accommodate a transfer of information about an AV system from one ODD to the next. In this paper, we reformulate the scenario sampling problem across ODDs and functionalities as a submodular optimization problem. To do so, we abstract AV performance as a Bayesian Hierarchical Model, which we use to infer information gained by revealing performance in new scenarios. We propose the information gain as a measure of scenario relevance and evaluation progress. Furthermore, we leverage the submodularity, or diminishing returns, property of the information gain not only to find a near-optimal scenario set, but also to propose a stopping criterion for an AV performance evaluation campaign. We find that we only need to explore about 7.5% of the scenario space to meet this criterion, a 23% improvement over Latin Hypercube Sampling.
翻译:随着自动飞行器(AVs)进入不断增长的操作设计域(ODD),它们需要经历一个系统、透明和可扩缩的评估过程,以展示其对社会的好处。当前AV性能评估的情景抽样技术通常侧重于特定的功能,例如车道变化,而不考虑将有关AV系统的信息从一个ODD系统转移到下一个系统。在本文中,我们重新将ODD和功能的情景抽样问题作为一个子模块优化问题来描述。为了做到这一点,我们将AV性能作为一种巴耶西亚高等级模型来抽象,我们用它来推断通过在新情景中显示性能而获得的信息。我们提出信息收益作为衡量情景相关性和评价进展的尺度。此外,我们利用信息亚模式或减少回报,不仅可以找到接近最佳的情景组合,而且还可以提出AV性性性能评估运动的停止标准。我们发现,我们只需要探索约7.5%的情景空间的7.5%才能达到这一标准,即对拉丁超音波采采金的23 %的改进。