We present an automatic method for annotating images of indoor scenes with the CAD models of the objects by relying on RGB-D scans. Through a visual evaluation by 3D experts, we show that our method retrieves annotations that are at least as accurate as manual annotations, and can thus be used as ground truth without the burden of manually annotating 3D data. We do this using an analysis-by-synthesis approach, which compares renderings of the CAD models with the captured scene. We introduce a 'cloning procedure' that identifies objects that have the same geometry, to annotate these objects with the same CAD models. This allows us to obtain complete annotations for the ScanNet dataset and the recent ARKitScenes dataset.
翻译:我们通过使用 RGB-D 扫描来用 CAD 对象模型说明室内场景图像的自动方法。 我们通过 3D 专家的视觉评估来显示, 我们的方法获取的说明至少与手动说明一样准确, 因而可以作为地面真相使用, 无需手工说明 3D 数据。 我们使用分析逐个合成的方法来这样做, 这种方法将 CAD 模型的图像与所捕场景进行比较。 我们引入了“ 调整程序 ”, 来识别具有相同几何特征的物体, 用相同的 CAD 模型来说明这些物体。 这使我们能够获取扫描网数据集和最近的 ARKitScenes 数据集的完整说明 。