3D object detection from images, one of the fundamental and challenging problems in autonomous driving, has received increasing attention from both industry and academia in recent years. Benefiting from the rapid development of deep learning technologies, image-based 3D detection has achieved remarkable progress. Particularly, more than 200 works have studied this problem from 2015 to 2021, encompassing a broad spectrum of theories, algorithms, and applications. However, to date no recent survey exists to collect and organize this knowledge. In this paper, we fill this gap in the literature and provide the first comprehensive survey of this novel and continuously growing research field, summarizing the most commonly used pipelines for image-based 3D detection and deeply analyzing each of their components. Additionally, we also propose two new taxonomies to organize the state-of-the-art methods into different categories, with the intent of providing a more systematic review of existing methods and facilitating fair comparisons with future works. In retrospect of what has been achieved so far, we also analyze the current challenges in the field and discuss future directions for image-based 3D detection research.
翻译:从图像中探测3D对象,这是自发驾驶方面一个根本性和具有挑战性的问题,近年来得到产业界和学术界越来越多的关注。从深层学习技术的迅速发展中受益,基于图像的3D探测取得了显著的进展。特别是,200多部作品研究了这一问题,从2015年至2021年,涵盖了广泛的理论、算法和应用。然而,到目前为止,还没有进行最近的调查来收集和组织这种知识。在本文件中,我们填补了文献中的这一空白,并首次全面调查了这个新颖和不断增长的研究领域,总结了最常用的基于图像的3D探测管道,并深入分析了其中每个组成部分。此外,我们还提议设立两个新的分类,将最新方法分为不同类别,目的是更系统地审查现有方法,便利与未来工作进行公平比较。回顾迄今取得的成就,我们还分析了实地目前的挑战,并讨论了基于图像的3D探测研究的未来方向。