3D Semantic Scene Completion (SSC) can provide dense geometric and semantic scene representations, which can be applied in the field of autonomous driving and robotic systems. It is challenging to estimate the complete geometry and semantics of a scene solely from visual images, and accurate depth information is crucial for restoring 3D geometry. In this paper, we propose the first stereo SSC method named OccDepth, which fully exploits implicit depth information from stereo images (or RGBD images) to help the recovery of 3D geometric structures. The Stereo Soft Feature Assignment (Stereo-SFA) module is proposed to better fuse 3D depth-aware features by implicitly learning the correlation between stereo images. In particular, when the input are RGBD image, a virtual stereo images can be generated through original RGB image and depth map. Besides, the Occupancy Aware Depth (OAD) module is used to obtain geometry-aware 3D features by knowledge distillation using pre-trained depth models. In addition, a reformed TartanAir benchmark, named SemanticTartanAir, is provided in this paper for further testing our OccDepth method on SSC task. Compared with the state-of-the-art RGB-inferred SSC method, extensive experiments on SemanticKITTI show that our OccDepth method achieves superior performance with improving +4.82% mIoU, of which +2.49% mIoU comes from stereo images and +2.33% mIoU comes from our proposed depth-aware method. Our code and trained models are available at https://github.com/megvii-research/OccDepth.
翻译:3D Semantic Scene 补全( SSC) 可以提供密度密度高的几何和语义化的场景图示( SSC), 可以应用于自主驾驶和机器人系统领域。 仅从视觉图像中估算场景的完整几何和语义, 准确的深度信息对于恢复 3D 几何来说至关重要。 在本文中, 我们提议首个名为 Occdepteh 的SSC 立体方法, 充分利用立体图像( 或 RGBD 图像) 的隐含深度信息来帮助恢复 3D UGB2 的几何结构。 Stereoeo Soft Featy (Stereo- SFA) 模块建议通过隐含性地学习立体图像之间的关联来更好地结合 3D深度感知特征。 特别是当输入为 RGBD 图像时, 一个虚拟立体图像可以通过原始 RGB 图像和深度地图生成生成。 此外, Occwasawa 透析( ORC) 模块使用知识蒸馏中我们用于改进的 Omantistry- Steptal- State- Stabilation- Studal 方法。</s>