This paper studies the problem of polygonal mapping of buildings by tackling the issue of mask reversibility that leads to a notable performance gap between the predicted masks and polygons from the learning-based methods. We addressed such an issue by exploiting the hierarchical supervision (of bottom-level vertices, mid-level line segments and the high-level regional masks) and proposed a novel interaction mechanism of feature embedding sourced from different levels of supervision signals to obtain reversible building masks for polygonal mapping of buildings. As a result, we show that the learned reversible building masks take all the merits of the advances of deep convolutional neural networks for high-performing polygonal mapping of buildings. In the experiments, we evaluated our method on the two public benchmarks of AICrowd and Inria. On the AICrowd dataset, our proposed method obtains unanimous improvements on the metrics of AP, APboundary and PoLiS. For the Inria dataset, our proposed method also obtains very competitive results on the metrics of IoU and Accuracy. The models and source code are available at https://github.com/SarahwXU.
翻译:本文研究建筑物多边形制图问题,通过处理面罩可逆性问题,导致预测的遮罩和学习方法产生的多边形之间显著的性能差距。我们通过利用等级监督(底层脊椎、中层线段和高级别区域面具)解决了这一问题,并提出了从不同级别的监督信号中嵌入特征的新的互动机制,以获得可逆建筑物多边形制图的建筑遮罩。结果,我们表明,所学的可逆盖面罩具有高性能多边形图的深层共振动神经网络进展的所有优点。在实验中,我们评估了我们关于AICrowd和Inria两个公共基准的方法。在AICrowd数据集中,我们拟议的方法在AP、AP边界和POLiS等标准上得到了一致的改进。关于Inria数据集,我们拟议的方法还获得了关于IOU和Accrocity的参数的极具竞争力的结果。模型和源代码见https://Xwrabus/Axrabrah。