前馈式三维重建与视图合成研究进展综述 (Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey)

Jiahui Zhang,Yuelei Li,Anpei Chen,Muyu Xu,Kunhao Liu,Jianyuan Wang,Xiao-Xiao Long,Hanxue Liang,Zexiang Xu,Hao Su,Christian Theobalt,Christian Rupprecht,Andrea Vedaldi,Kaichen Zhou,Paul Pu Liang,Shijian Lu,Fangneng Zhan

from arxiv, A project page associated with this survey is available at https://fnzhan.com/projects/Feed-Forward-3D

3D reconstruction and view synthesis are foundational problems in computer vision, graphics, and immersive technologies such as augmented reality (AR), virtual reality (VR), and digital twins. Traditional methods rely on computationally intensive iterative optimization in a complex chain, limiting their applicability in real-world scenarios. Recent advances in feed-forward approaches, driven by deep learning, have revolutionized this field by enabling fast and generalizable 3D reconstruction and view synthesis. This survey offers a comprehensive review of feed-forward techniques for 3D reconstruction and view synthesis, with a taxonomy according to the underlying representation architectures including point cloud, 3D Gaussian Splatting (3DGS), Neural Radiance Fields (NeRF), etc. We examine key tasks such as pose-free reconstruction, dynamic 3D reconstruction, and 3D-aware image and video synthesis, highlighting their applications in digital humans, SLAM, robotics, and beyond. In addition, we review commonly used datasets with detailed statistics, along with evaluation protocols for various downstream tasks. We conclude by discussing open research challenges and promising directions for future work, emphasizing the potential of feed-forward approaches to advance the state of the art in 3D vision.

翻译：三维重建与视图合成是计算机视觉、图形学以及增强现实（AR）、虚拟现实（VR）和数字孪生等沉浸式技术中的基础性问题。传统方法依赖于复杂流程中的计算密集型迭代优化，限制了其在实际场景中的应用。近年来，由深度学习驱动的前馈式方法实现了快速且可泛化的三维重建与视图合成，彻底改变了这一领域。本综述系统回顾了用于三维重建与视图合成的前馈式技术，并依据底层表示架构（包括点云、三维高斯泼溅（3DGS）、神经辐射场（NeRF）等）进行了分类梳理。我们探讨了关键任务，如无姿态重建、动态三维重建以及三维感知的图像与视频合成，并重点阐述了它们在数字人、SLAM、机器人学等领域的应用。此外，我们回顾了常用数据集及其详细统计信息，以及针对各类下游任务的评估协议。最后，我们讨论了当前面临的开放性研究挑战与未来工作的潜在方向，强调了前馈式方法在推动三维视觉技术前沿发展中的潜力。