Extracting building footprints from aerial images is essential for precise urban mapping with photogrammetric computer vision technologies. Existing approaches mainly assume that the roof and footprint of a building are well overlapped, which may not hold in off-nadir aerial images as there is often a big offset between them. In this paper, we propose an offset vector learning scheme, which turns the building footprint extraction problem in off-nadir images into an instance-level joint prediction problem of the building roof and its corresponding "roof to footprint" offset vector. Thus the footprint can be estimated by translating the predicted roof mask according to the predicted offset vector. We further propose a simple but effective feature-level offset augmentation module, which can significantly refine the offset vector prediction by introducing little extra cost. Moreover, a new dataset, Buildings in Off-Nadir Aerial Images (BONAI), is created and released in this paper. It contains 268,958 building instances across 3,300 aerial images with fully annotated instance-level roof, footprint, and corresponding offset vector for each building. Experiments on the BONAI dataset demonstrate that our method achieves the state-of-the-art, outperforming other competitors by 3.37 to 7.39 points in F1-score. The codes, datasets, and trained models are available at https://github.com/jwwangchn/BONAI.git.
翻译:从空中图像中提取建筑足迹对于精确的城市制图和光度计算机视觉技术至关重要。现有办法主要假定建筑物的屋顶和足迹是完全重叠的,可能无法维持在离纳迪尔的空中图像中,因为这些图像往往有很大的抵消作用。在本文件中,我们提议了一个抵消矢量学习计划,将建筑物足迹的建筑足迹提取问题转化为建筑物屋顶及其相应的“屋顶到脚印”矢量的试想性联合预测问题。因此,足迹可以通过根据预测的抵消矢量翻译预测的屋顶遮罩来估计。我们进一步提议一个简单而有效的地平级增压模块,通过引入少量的额外费用来大大改进抵消矢量的预测。此外,本文件还创建并发布了一个新的数据集,即离纳迪尔空中图像(BONAI)中的建筑足迹提取问题,将建筑屋顶上的建筑足迹转化为3 300个空中图像,并配有完全附加说明性的屋顶、足迹和相应的抵消矢量矢量矢量。BONAI数据集的实验表明,我们的方法实现了州-州/Art-Art的模型,3。通过Fajircrecreal ex acreal decrestrate dest ams