Vision-centric BEV perception has recently received increased attention from both industry and academia due to its inherent merits, including presenting a natural representation of the world and being fusion-friendly. With the rapid development of deep learning, numerous methods have been proposed to address the vision-centric BEV perception. However, there is no recent survey for this novel and growing research field. To stimulate its future research, this paper presents a comprehensive survey of recent progress of vision-centric BEV perception and its extensions. It collects and organizes the recent knowledge, and gives a systematic review and summary of commonly used algorithms. It also provides in-depth analyses and comparative results on several BEV perception tasks, facilitating the comparisons of future works and inspiring future research directions. Moreover, empirical implementation details are also discussed and shown to benefit the development of related algorithms.
翻译:最近,产业界和学术界由于其内在优点,包括自然地代表世界,并且对融合友好,对以视觉为中心的BEV概念的看法给予了越来越多的关注。随着深层学习的迅速发展,提出了许多方法来解决以视觉为中心的BEV概念的看法。然而,最近没有对这一新颖和不断增长的研究领域进行调查。为了刺激其未来的研究,本文件对以视觉为中心的BEV概念及其扩展的最近进展进行了全面调查。它收集并组织最近的知识,对常用的算法进行了系统的审查和总结。它还就一些BEV概念的任务提供了深入分析和比较结果,便利了对未来工作的比较,并激励了未来的研究方向。此外,还讨论并展示了经验实施细节,以利于相关算法的发展。