Semantic segmentation of building facade is significant in various applications, such as urban building reconstruction and damage assessment. As there is a lack of 3D point clouds datasets related to the fine-grained building facade, we construct the first large-scale building facade point clouds benchmark dataset for semantic segmentation. The existing methods of semantic segmentation cannot fully mine the local neighborhood information of point clouds. Addressing this problem, we propose a learnable attention module that learns Dual Local Attention features, called DLA in this paper. The proposed DLA module consists of two blocks, including the self-attention block and attentive pooling block, which both embed an enhanced position encoding block. The DLA module could be easily embedded into various network architectures for point cloud segmentation, naturally resulting in a new 3D semantic segmentation network with an encoder-decoder architecture, called DLA-Net in this work. Extensive experimental results on our constructed building facade dataset demonstrate that the proposed DLA-Net achieves better performance than the state-of-the-art methods for semantic segmentation.
翻译:在各种应用中,例如城市建筑重建和损坏评估中,建筑表面的静默分解很重要。由于缺少3D点云层数据集,我们建造了第一个大型建筑外观云层基准数据集,用于静态分解。现有的语义分解方法无法完全清除点云的当地周边信息。解决这个问题,我们提议了一个学习双地方注意特征的可学习关注模块,在本文中称为DLA。拟议的DLA模块由两个区块组成,包括自留区块和专心集合区块,这两个区块都嵌入了一个强化的定位编码区块。DLA模块可以很容易地嵌入用于点云分解的各种网络结构中,自然形成一个新的 3D 语系分解网络,并配有点云层分解器结构,在这项工作中称为DLA-Net。我们建造的建筑外墙数据集的广泛实验结果显示,拟议的DLA-Net的性能优于州立断层法的方法。