High-resolution satellite images can provide abundant, detailed spatial information for land cover classification, which is particularly important for studying the complicated built environment. However, due to the complex land cover patterns, the costly training sample collections, and the severe distribution shifts of satellite imageries, few studies have applied high-resolution images to land cover mapping in detailed categories at large scale. To fill this gap, we present a large-scale land cover dataset, Five-Billion-Pixels. It contains more than 5 billion labeled pixels of 150 high-resolution Gaofen-2 (4 m) satellite images, annotated in a 24-category system covering artificial-constructed, agricultural, and natural classes. In addition, we propose a deep-learning-based unsupervised domain adaptation approach that can transfer classification models trained on labeled dataset (referred to as the source domain) to unlabeled data (referred to as the target domain) for large-scale land cover mapping. Specifically, we introduce an end-to-end Siamese network employing dynamic pseudo-label assignment and class balancing strategy to perform adaptive domain joint learning. To validate the generalizability of our dataset and the proposed approach across different sensors and different geographical regions, we carry out land cover mapping on five megacities in China and six cities in other five Asian countries severally using: PlanetScope (3 m), Gaofen-1 (8 m), and Sentinel-2 (10 m) satellite images. Over a total study area of 60,000 square kilometers, the experiments show promising results even though the input images are entirely unlabeled. The proposed approach, trained with the Five-Billion-Pixels dataset, enables high-quality and detailed land cover mapping across the whole country of China and some other Asian countries at meter-resolution.
翻译:高分辨率卫星图像可以为土地覆盖分类提供大量、详细的空间信息,这对于研究复杂的建筑环境尤其重要。然而,由于复杂的土地覆盖模式、昂贵的培训样本收集以及卫星图像的大规模分布变化等原因,很少有研究将高分辨率图像应用于大规模详细类别的土地覆盖绘图。为了填补这一空白,我们提出了一个大型土地覆盖数据集,即五亿亿比亚比塞。它包含有超过50亿个标记的中国高分辨率Gaofen-2(4米)卫星图像像素,在覆盖人工构造、农业和自然等级的24类图象系统中附加附加说明。此外,我们建议采用基于深层次学习的、不受监督的域适应方法,将经过标记数据集(称为源域)培训的分类模型转换为(称为目标域)大规模土地覆盖绘图的无标签数据。具体来说,我们采用动态假标签和类平衡方法,用于进行适应性域内联合学习的24类图象。此外,我们建议采用基于不同地理层次的六层图像系统,在五个不同区域进行总体测绘(10个数字传感器),并展示了亚洲不同地域域域域域域域域域域域域域中的拟议数据。