Self-driving cars and trucks, autonomous vehicles (AVs), should not be accepted by regulatory bodies and the public until they have much higher confidence in their safety and reliability -- which can most practically and convincingly be achieved by testing. But existing testing methods are inadequate for checking the end-to-end behaviors of AV controllers against complex, real-world corner cases involving interactions with multiple independent agents such as pedestrians and human-driven vehicles. While test-driving AVs on streets and highways fails to capture many rare events, existing simulation-based testing methods mainly focus on simple scenarios and do not scale well for complex driving situations that require sophisticated awareness of the surroundings. To address these limitations, we propose a new fuzz testing technique, called AutoFuzz, which can leverage widely-used AV simulators' API grammars to generate semantically and temporally valid complex driving scenarios (sequences of scenes). To efficiently search for traffic violations-inducing scenarios in a large search space, we propose a constrained neural network (NN) evolutionary search method to optimize AutoFuzz. Evaluation of our prototype on one state-of-the-art learning-based controller, two rule-based controllers, and one industrial-grade controller in five scenarios shows that AutoFuzz efficiently finds hundreds of traffic violations in high-fidelity simulation environments. For each scenario, AutoFuzz can find on average 10-39% more unique traffic violations than the best-performing baseline method. Further, fine-tuning the learning-based controller with the traffic violations found by AutoFuzz successfully reduced the traffic violations found in the new version of the AV controller software.
翻译:自动驾驶汽车和卡车、自主车辆等,不应被监管机构和公众所接受,除非它们对其安全和可靠性的信心大大提高 -- -- 最实际和令人信服地可以通过测试实现。但现有的测试方法不足以检查AV控制器的端到端行为,以防范与行人和人驱动车辆等多个独立代理人互动的复杂、真实世界角落案件。在街头和高速公路上测试驾驶AV未能捕捉许多罕见事件的同时,现有基于模拟的测试方法主要侧重于简单的假设情况,而对于需要精密了解周围的复杂驾驶情况,则不进行适当的规模。为了应对这些限制,我们建议采用一种新的模糊测试技术,称为AutoFuzz,它可以利用广泛使用的AVIV控制器的端到端到端行为来检查AVI控制器的复杂、实时的复杂驾驶程序。为了在大型搜索空间中高效地搜索交通违规情况,我们建议采用一个有节制的神经网络(NNEN)进化搜索方法,以优化交通流量的精确度。