When developing autonomous driving systems (ADS), developers often need to replay previously collected driving recordings to check the correctness of newly introduced changes to the system. However, simply replaying the entire recording is not necessary given the high redundancy of driving scenes in a recording (e.g., keeping the same lane for 10 minutes on a highway). In this paper, we propose a novel test reduction and prioritization approach for multi-module ADS. First, our approach automatically encodes frames in a driving recording to feature vectors based on a driving scene schema. Then, the given recording is sliced into segments based on the similarity of consecutive vectors. Lengthy segments are truncated to reduce the length of a recording and redundant segments with the same vector are removed. The remaining segments are prioritized based on both the coverage and the rarity of driving scenes. We implemented this approach on an industry level, multi-module ADS called Apollo and evaluated it on three road maps in various regression settings. The results show that our approach significantly reduced the original recordings by over 34% while keeping comparable test effectiveness, identifying almost all injected faults. Furthermore, our test prioritization method achieves about 22% to 39% and 41% to 53% improvements over three baselines in terms of both the average percentage of faults detected (APFD) and TOP-K.
翻译:在开发自主驾驶系统(ADS)时,开发者往往需要重放先前收集的驾驶记录,以检查系统新引入的修改的正确性。然而,由于记录中的驾驶场景(例如将同一车道保留在高速公路上10分钟)的冗余程度很高,简单重放整个记录并不必要。在本文中,我们建议对多模块ADS采用新的测试减少和优先排序方法。首先,我们的方法自动编码框架,根据驾驶场景的系统图案对矢量进行定位的驱动记录。然后,根据连续矢量的相似性,将特定记录切分成部分。由于同一矢量的录音和冗余段的长度被缩短,因此不必再重复整个记录。其余部分则根据覆盖和驱动场的罕见性排列优先次序。我们在一个行业层面实施了这一方法,多模块ADS ADS阿波罗对三种路径图进行了评估。结果显示,我们的方法大大减少了34%以上的原始记录,同时保持了相似的测试效果,确定了几乎所有的输入错误。此外,我们测试比例法中的平均比例为39%至53%。