稀缺标签区耕地分类有效方法 (An Efficient Method for the Classification of Croplands in Scarce-Label Regions)

Two of the main challenges for cropland classification by satellite time-series images are insufficient ground-truth data and inaccessibility of high-quality hyperspectral images for under-developed areas. Unlabeled medium-resolution satellite images are abundant, but how to benefit from them is an open question. We will show how to leverage their potential for cropland classification using self-supervised tasks. Self-supervision is an approach where we provide simple training signals for the samples, which are apparent from the data's structure. Hence, they are cheap to acquire and explain a simple concept about the data. We introduce three self-supervised tasks for cropland classification. They reduce epistemic uncertainty, and the resulting model shows superior accuracy in a wide range of settings compared to SVM and Random Forest. Subsequently, we use the self-supervised tasks to perform unsupervised domain adaptation and benefit from the labeled samples in other regions. It is crucial to know what information to transfer to avoid degrading the performance. We show how to automate the information selection and transfer process in cropland classification even when the source and target areas have a very different feature distribution. We improved the model by about 24% compared to a baseline architecture without any labeled sample in the target domain. Our method is amenable to gradual improvement, works with medium-resolution satellite images, and does not require complicated models. Code and data are available.

翻译：以卫星时间序列图像对耕地进行分类的主要挑战有两大:地面数据不足,无法为开发不足的地区获取高质量的超光谱图像。无标签的中分辨率卫星图像丰富,但如何从中受益是一个未决问题。我们将展示如何利用自己监督的任务来利用其在耕地分类方面的潜力。自我监督是一种方法,我们为样本提供简单的培训信号,从数据结构中可以明显看出这一点。因此,获取和解释有关数据简单概念是便宜的。我们为耕地分类引入了三种自我监督的超光谱图像。它们减少了缩影不确定性,由此产生的模型显示,与SVM和随机森林相比,各种环境的准确性更高。随后,我们将利用自我监督的任务来进行不受监督的域调整,并从其他区域的标签样本中受益。了解哪些信息可以转让以避免降低性能。我们展示了如何将耕地分类中的信息选择和传输过程自动化,即使源和目标区域有非常复杂的来源和目标区域分类。它们减少了缩略图的不确定性,而所产生的模型显示,相对于SVM和随机森林而言,我们使用自我监督的任务来进行不受监督的调整。我们使用一种不易变式的模型,我们改进的模型可以使用。我们使用一种比较的模型,我们使用任何可变式的模型,从而改进了一种可变式的模型进行。