Most 3D instance segmentation methods exploit a bottom-up strategy, typically including resource-exhaustive post-processing. For point grouping, bottom-up methods rely on prior assumptions about the objects in the form of hyperparameters, which are domain-specific and need to be carefully tuned. On the contrary, we address 3D instance segmentation with a TD3D: top-down, fully data-driven, simple approach trained in an end-to-end manner. With its straightforward fully-convolutional pipeline, it performs surprisingly well on the standard benchmarks: ScanNet v2, its extension ScanNet200, and S3DIS. Besides, our method is much faster on inference than the current state-of-the-art grouping-based approaches. Code is available at https://github.com/SamsungLabs/td3d .
翻译:大多数3D例分解方法都利用自下而上的战略,通常包括资源详尽的后处理。对于点分组,自下而上的方法依赖于以超参数形式对物体的先前假设,超参数是特定领域,需要仔细调整。相反,我们用TD3D处理3D例分解方法,即自上而下、完全由数据驱动、以端到端方式培训的简单方法。它以直截了当的全横向管道,在标准基准(ScanNet v2、扩展的ScanNet200和S3DIS)上表现得令人惊讶。此外,我们的计算方法比目前基于艺术分组的状态方法要快得多。代码可在 https://github.com/SamsungLabs/td3d上查阅。