Deep learning based methods have significantly boosted the study of automatic building extraction from remote sensing images. However, delineating vectorized and regular building contours like a human does remains very challenging, due to the difficulty of the methodology, the diversity of building structures, and the imperfect imaging conditions. In this paper, we propose the first end-to-end learnable building contour extraction framework, named BuildMapper, which can directly and efficiently delineate building polygons just as a human does. BuildMapper consists of two main components: 1) a contour initialization module that generates initial building contours; and 2) a contour evolution module that performs both contour vertex deformation and reduction, which removes the need for complex empirical post-processing used in existing methods. In both components, we provide new ideas, including a learnable contour initialization method to replace the empirical methods, dynamic predicted and ground truth vertex pairing for the static vertex correspondence problem, and a lightweight encoder for vertex information extraction and aggregation, which benefit a general contour-based method; and a well-designed vertex classification head for building corner vertices detection, which casts light on direct structured building contour extraction. We also built a suitable large-scale building dataset, the WHU-Mix (vector) building dataset, to benefit the study of contour-based building extraction methods. The extensive experiments conducted on the WHU-Mix (vector) dataset, the WHU dataset, and the CrowdAI dataset verified that BuildMapper can achieve a state-of-the-art performance, with a higher mask average precision (AP) and boundary AP than both segmentation-based and contour-based methods.
翻译:深层学习方法极大地推动了从遥感图像中自动提取建筑图象的研究。然而,由于方法的难度、建筑结构的多样性和不完善的成像条件,分解矢量和定期建筑等像人类那样的轮廓仍然非常具有挑战性。在本文中,我们提出了第一个端到端可学习的建筑轮廓提取框架,名为BuildMapper,它可以直接和有效地像人类那样对建筑多边形体进行划界。构建Mapper由两个主要组成部分组成:1)一个生成初始建筑轮廓的等离子初始初始化模块;2)一个等离子演化模块,既进行等离子垂直变形和缩小,从而消除现有方法中使用的复杂实验性后处理的需要。在这两个组成部分中,我们提出了新的想法,包括一种可学习的轮廓初始化方法,以取代静态的顶端通信问题为动态预测和地面真相对接,以及一个较轻量的内向信息提取和汇总的内向值解码导,这有利于基于一般轮廓的构造的内向流流流变和减小的内流演算方法;以及一个结构的内置的内置的内置的内置的内置的内置的内置数据分析方法,也可以的内置数据分析方法,以及一个结构的内置的内置的内置的内置的内置的内置的内置的内置的内置的内置的内置的内置的内置的内置的内存方法,可以使数据性数据性能的内置的内置的内置的内置的内置的内置的内置数据性能的内置的内置的内存方法,以及制的内置的内置的内置的内置的内置数据的内置的内置数据的内置数据的内置的内置的内置的内置的内置的内置的内置的内置的内置的内置的内置的内置方法,以及结构的内置的内置的内置的内置的内置的内置的内置的内置数据。