We introduce BuildingNet: (a) a large-scale dataset of 3D building models whose exteriors are consistently labeled, (b) a graph neural network that labels building meshes by analyzing spatial and structural relations of their geometric primitives. To create our dataset, we used crowdsourcing combined with expert guidance, resulting in 513K annotated mesh primitives, grouped into 292K semantic part components across 2K building models. The dataset covers several building categories, such as houses, churches, skyscrapers, town halls, libraries, and castles. We include a benchmark for evaluating mesh and point cloud labeling. Buildings have more challenging structural complexity compared to objects in existing benchmarks (e.g., ShapeNet, PartNet), thus, we hope that our dataset can nurture the development of algorithms that are able to cope with such large-scale geometric data for both vision and graphics tasks e.g., 3D semantic segmentation, part-based generative models, correspondences, texturing, and analysis of point cloud data acquired from real-world buildings. Finally, we show that our mesh-based graph neural network significantly improves performance over several baselines for labeling 3D meshes.
翻译:我们引入了建筑网:(a) 外部标签一致的3D建筑模型的大型数据集,(b) 通过分析其几何原始的空间和结构关系来标注用于建造网象的图形神经网络。为了创建我们的数据集,我们利用众包与专家指导相结合,产生了513K的附加说明的网状原始体,将其分为292K的语义部分,横跨2K建筑模型。数据集涵盖若干建筑类别,如房屋、教堂、摩天大楼、市政厅、图书馆和城堡。我们包括一个用于评价网状和点云标签的基准。建筑与现有基准中的对象(如ShapeNet、PartNet)相比,结构复杂性更大。因此,我们希望我们的数据集能够培育出能够应对视觉和图形任务(如3D的语义分割、部分基于组合的模式、通信、文字以及从现实世界建筑获得的点云数据的分析等)等大型的地理测量数据的算法发展。最后,我们希望我们的数据集能够大大改进我们基于模型的图像网络。