Recent advances in modeling 3D objects mostly rely on synthetic datasets due to the lack of large-scale realscanned 3D databases. To facilitate the development of 3D perception, reconstruction, and generation in the real world, we propose OmniObject3D, a large vocabulary 3D object dataset with massive high-quality real-scanned 3D objects. OmniObject3D has several appealing properties: 1) Large Vocabulary: It comprises 6,000 scanned objects in 190 daily categories, sharing common classes with popular 2D datasets (e.g., ImageNet and LVIS), benefiting the pursuit of generalizable 3D representations. 2) Rich Annotations: Each 3D object is captured with both 2D and 3D sensors, providing textured meshes, point clouds, multiview rendered images, and multiple real-captured videos. 3) Realistic Scans: The professional scanners support highquality object scans with precise shapes and realistic appearances. With the vast exploration space offered by OmniObject3D, we carefully set up four evaluation tracks: a) robust 3D perception, b) novel-view synthesis, c) neural surface reconstruction, and d) 3D object generation. Extensive studies are performed on these four benchmarks, revealing new observations, challenges, and opportunities for future research in realistic 3D vision.
翻译:建模 3D 对象的最近进展主要依赖合成数据集,因为缺少大规模真实的 3D 数据库。为了便利3D 概念的开发、重建和在现实世界中生成,我们提议OmniObject3D, 一个大型词汇 3D 对象数据集,包含大量高质量真实扫描的 3D 对象。OmniObject3D 有几种吸引人的特性:(1) 大型词汇:它由190个每日类别中的6 000个扫描对象组成,与受欢迎的 2D 数据集(如图像网和LVIS)共享共同的类别,有利于通用的 3D 表示的追求。(2) 丰富的说明:每个3D 对象都以2D 和 3D 传感器同时捕获,提供文本的模具、点云、多视图的图像和多个真实的视频。(3) 现实扫描:专业扫描仪支持具有精确形状和真实外观的高质量对象扫描。随着OmniOject3D 提供的广阔探索空间,我们仔细地设置了四个评估轨道:强的3D 视野,这些视野的地面研究、 以及未来研究的展望3D 。