Estimating the number of buildings in any geographical region is a vital component of urban analysis, disaster management, and public policy decision. Deep learning methods for building localization and counting in satellite imagery, can serve as a viable and cheap alternative. However, these algorithms suffer performance degradation when applied to the regions on which they have not been trained. Current large datasets mostly cover the developed regions and collecting such datasets for every region is a costly, time-consuming, and difficult endeavor. In this paper, we propose an unsupervised domain adaptation method for counting buildings where we use a labeled source domain (developed regions) and adapt the trained model on an unlabeled target domain (developing regions). We initially align distribution maps across domains by aligning the output space distribution through adversarial loss. We then exploit counting consistency constraints, within-image count consistency, and across-image count consistency, to decrease the domain shift. Within-image consistency enforces that building count in the whole image should be greater than or equal to count in any of its sub-image. Across-image consistency constraint enforces that if an image contains considerably more buildings than the other image, then their sub-images shall also have the same order. These two constraints encourage the behavior to be consistent across and within the images, regardless of the scale. To evaluate the performance of our proposed approach, we collected and annotated a large-scale dataset consisting of challenging South Asian regions having higher building densities and irregular structures as compared to existing datasets. We perform extensive experiments to verify the efficacy of our approach and report improvements of approximately 7% to 20% over the competitive baseline methods.
翻译:估计任何地理区域的建筑数量是城市分析、灾害管理和公共政策决定的重要组成部分。 建立本地化和在卫星图像中计数的深层次学习方法可以成为一个可行和廉价的替代方法。 但是,这些算法在应用到没有经过培训的区域时会发生性能退化。 目前的大型数据集主要覆盖发达区域,收集每个区域的此类数据集是一项昂贵、耗时和困难的工作。 在本文件中,我们提议一种不受监督的域内调适方法,用于计算使用标签源域(发达区域)的建筑物,以及将经过训练的模型改制成无标签目标域(发展中区域)的建筑物。我们最初通过对敌对性损失调整产出空间分布的分布图,对各不同区域进行调和调校准。 我们随后利用了一致性的数据集,内部一致性要求在整个图像中增加或等值的数值,在任何子图像中增加或等值。 横向一致性限制是,如果将图象的图象比比的图象比,则要大大地比重,则要比照整个图象的图象结构。