Deep neural networks (DNN) which are employed in perception systems for autonomous driving require a huge amount of data to train on, as they must reliably achieve high performance in all kinds of situations. However, these DNN are usually restricted to a closed set of semantic classes available in their training data, and are therefore unreliable when confronted with previously unseen instances. Thus, multiple perception datasets have been created for the evaluation of anomaly detection methods, which can be categorized into three groups: real anomalies in real-world, synthetic anomalies augmented into real-world and completely synthetic scenes. This survey provides a structured and, to the best of our knowledge, complete overview and comparison of perception datasets for anomaly detection in autonomous driving. Each chapter provides information about tasks and ground truth, context information, and licenses. Additionally, we discuss current weaknesses and gaps in existing datasets to underline the importance of developing further data.
翻译:在自主驾驶的感知系统中使用的深神经网络(DNN)需要大量数据来进行训练,因为它们必须在各种情况下可靠地取得高性能;然而,这些DNN通常限于其培训数据中可提供的一套封闭的语义类,因此在遇到以前不为人知的情况时不可靠;因此,为评估异常现象探测方法建立了多种感知数据集,可分为三类:真实世界中真正的异常现象、合成异常现象扩大为真实世界和完全合成的场景;这项调查根据我们的知识,为自动驾驶中异常现象探测的感知数据集提供了结构化的完整概览和比较;每一章都提供了关于任务和地面真相、背景信息和许可证的信息;此外,我们讨论了现有数据组中目前的弱点和差距,以强调进一步发展数据的重要性。