走向野生单一图像的泛光 3D 剖析 (Towards Panoptic 3D Parsing for Single Image in the Wild)

Performing single image holistic understanding and 3D reconstruction is a central task in computer vision. This paper presents an integrated system that performs holistic image segmentation, object detection, instance segmentation, depth estimation, and object instance 3D reconstruction for indoor and outdoor scenes from a single RGB image. We name our system panoptic 3D parsing in which panoptic segmentation ("stuff" segmentation and "things" detection/segmentation) with 3D reconstruction is performed. We design a stage-wise system where a complete set of annotations is absent. Additionally, we present an end-to-end pipeline trained on a synthetic dataset with a full set of annotations. We show results on both indoor (3D-FRONT) and outdoor (COCO and Cityscapes) scenes. Our proposed panoptic 3D parsing framework points to a promising direction in computer vision. It can be applied to various applications, including autonomous driving, mapping, robotics, design, computer graphics, robotics, human-computer interaction, and augmented reality.

翻译：执行单一图像整体理解和3D重建是计算机愿景的一项核心任务。本文展示了一个集成系统, 用于从一个 RGB 图像中进行整体图像分割、对象探测、试区分割、深度估计和对象实例 3D, 用于室内和室外场景的重建。我们命名了我们的系统全光 3D 剖析, 用于3D 重建的全光分割(“ 附加” 和“ 显示” 探测/ 分层 ) 。我们设计了一个舞台系统, 缺少完整的说明。此外, 我们展示了一个经过合成数据集培训的端对端管道, 配有全套说明。我们在室内( 3D- FRONT) 和室外( CO和 Cityscovers) 场景上都展示了结果。我们提议的全光 3D 剖析框架指向计算机愿景有希望的方向。它可以应用于各种应用, 包括自主驱动、制图、机器人、设计、计算机图形、机器人、机器人、人- 计算机互动以及增强现实。

相关内容

三维重建

关注 1173

在计算机视觉中, 三维重建是指根据单视图或者多视图的图像重建三维信息的过程. 由于单视频的信息不完全,因此三维重建需要利用经验知识. 而多视图的三维重建(类似人的双目定位)相对比较容易, 其方法是先对摄像机进行标定, 即计算出摄像机的图象坐标系与世界坐标系的关系.然后利用多个二维图象中的信息重建出三维信息。物体三维重建是计算机辅助几何设计(CAGD)、计算机图形学(CG)、计算机动画、计算机视觉、医学图像处理、科学计算和虚拟现实、数字媒体创作等领域的共性科学问题和核心技术。在计算机内生成物体三维表示主要有两类方法。一类是使用几何建模软件通过人机交互生成人为控制下的物体三维几何模型,另一类是通过一定的手段获取真实物体的几何形状。前者实现技术已经十分成熟,现有若干软件支持,比如:3DMAX、Maya、AutoCAD、UG等等,它们一般使用具有数学表达式的曲线曲面表示几何形状。后者一般称为三维重建过程,三维重建是指利用二维投影恢复物体三维信息(形状等)的数学过程和计算机技术,包括数据获取、预处理、点云拼接和特征分析等步骤。

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知会员服务

77+阅读 · 2020年7月23日