In recent years self-supervised learning has emerged as a promising candidate for unsupervised representation learning. In the visual domain its applications are mostly studied in the context of images of natural scenes. However, its applicability is especially interesting in specific areas, like remote sensing and medicine, where it is hard to obtain huge amounts of labeled data. In this work, we conduct an extensive analysis of the applicability of self-supervised learning in remote sensing image classification. We analyze the influence of the number and domain of images used for self-supervised pre-training on the performance on downstream tasks. We show that, for the downstream task of remote sensing image classification, using self-supervised pre-training on remote sensing images can give better results than using supervised pre-training on images of natural scenes. Besides, we also show that self-supervised pre-training can be easily extended to multispectral images producing even better results on our downstream tasks.
翻译:近年来,自我监督的学习已成为不受监督的演示学习的一个有希望的候选对象。在视觉领域,其应用大多在自然场景图像的范围内研究。然而,在遥感和医学等难以获得大量标签数据的特定领域,其适用性特别有趣。在这项工作中,我们对自我监督的图像学习在遥感图像分类中的适用性进行了广泛分析。我们分析了自我监督的下游任务业绩前培训中使用的图像数量和范围的影响。我们表明,在遥感图像分类的下游任务中,使用自监督的预培训比使用自然场景图像监督的预培训可以产生更好的结果。此外,我们还表明,自我监督的预培训可以很容易地扩展到多光谱图像,从而产生更好的下游任务成果。