After the 2017 TuSimple Lane Detection Challenge, its evaluation based on accuracy and F1 score has become the de facto standard to measure the performance of lane detection methods. In this work, we conduct the first large-scale empirical study to evaluate the robustness of state-of-the-art lane detection methods under physical-world adversarial attacks in autonomous driving. We evaluate 4 major types of lane detection approaches with the conventional evaluation and end-to-end evaluation in autonomous driving scenarios and then discuss the security proprieties of each lane detection model. We demonstrate that the conventional evaluation fails to reflect the robustness in end-to-end autonomous driving scenarios. Our results show that the most robust model on the conventional metrics is the least robust in the end-to-end evaluation. Although the competition dataset and its metrics have played a substantial role in developing performant lane detection methods along with the rapid development of deep neural networks, the conventional evaluation is becoming obsolete and the gap between the metrics and practicality is critical. We hope that our study will help the community make further progress in building a more comprehensive framework to evaluate lane detection models.
翻译:在2017年TuSemple Lane探测挑战之后,根据准确性和F1评分进行的评价已成为衡量车道探测方法绩效的实际标准。在这项工作中,我们进行了第一次大规模的经验性研究,以评价在自然世界对立式自动驾驶攻击下最先进的车道探测方法的稳健性。我们用常规评价和自动驾驶情景的端对端评价来评价4种主要车道探测方法,然后讨论每个车道探测模型的安全性能。我们证明,常规评价未能反映端对端自动驾驶假设的稳健性。我们的结果显示,在最终至端评价中,最有力的常规计量模型是最不稳的。虽然竞争数据集及其参数在开发性能航道探测方法以及深神经网络的迅速发展方面发挥了重大作用,但常规评价已经过时,衡量标准与实用性之间的差距至关重要。我们希望我们的研究将帮助社区在建立更全面框架以评价车道探测模型方面取得进一步进展。