Evaluating the reliability of intelligent physical systems against rare safety-critical events poses a huge testing burden for real-world applications. Simulation provides a useful platform to evaluate the extremal risks of these systems before their deployments. Importance Sampling (IS), while proven to be powerful for rare-event simulation, faces challenges in handling these learning-based systems due to their black-box nature that fundamentally undermines its efficiency guarantee, which can lead to under-estimation without diagnostically detected. We propose a framework called Deep Probabilistic Accelerated Evaluation (Deep-PrAE) to design statistically guaranteed IS, by converting black-box samplers that are versatile but could lack guarantees, into one with what we call a relaxed efficiency certificate that allows accurate estimation of bounds on the safety-critical event probability. We present the theory of Deep-PrAE that combines the dominating point concept with rare-event set learning via deep neural network classifiers, and demonstrate its effectiveness in numerical examples including the safety-testing of an intelligent driving algorithm.
翻译:评估智能物理系统的可靠性以防范罕见的安全临界事件给现实世界应用带来了巨大的测试负担。 模拟为评估这些系统在部署之前的极端风险提供了一个有用的平台。 重要性取样(IS)虽然被证明对稀有事件模拟很有影响力,但由于其黑盒性质从根本上破坏了其效率保障,可能导致在不诊断性检测的情况下低估其效率保障。 我们提议了一个称为深概率快速评估(深概率评估)的框架,以设计具有统计保障的IS,将具有多种功能但可能缺乏保障的黑盒取样器转换成一个我们称之为宽松的效率证书的平台,以便准确估计安全临界事件概率的界限。 我们提出了深海定位理论,将支配点概念与通过深神经网络分类器学习的稀有事件设置结合起来,并在数字实例中展示其有效性,包括智能驾驶算法的安全测试。