The development of accurate methods for multi-label classification (MLC) of remote sensing (RS) images is one of the most important research topics in RS. Methods based on Deep Convolutional Neural Networks (CNNs) have shown strong performance gains in RS MLC problems. However, CNN-based methods usually require a high number of reliable training images annotated by multiple land-cover class labels. Collecting such data is time-consuming and costly. To address this problem, the publicly available thematic products, which can include noisy labels, can be used to annotate RS images with zero-labeling cost. However, multi-label noise (which can be associated with wrong and missing label annotations) can distort the learning process of the MLC algorithm. The detection and correction of label noise are challenging tasks, especially in a multi-label scenario, where each image can be associated with more than one label. To address this problem, we propose a novel noise robust collaborative multi-label learning (RCML) method to alleviate the adverse effects of multi-label noise during the training phase of the CNN model. RCML identifies, ranks and excludes noisy multi-labels in RS images based on three main modules: 1) discrepancy module; 2) group lasso module; and 3) swap module. The discrepancy module ensures that the two networks learn diverse features, while producing the same predictions. The task of the group lasso module is to detect the potentially noisy labels assigned to the multi-labeled training images, while the swap module task is devoted to exchanging the ranking information between two networks. Unlike existing methods that make assumptions about the noise distribution, our proposed RCML does not make any prior assumption about the type of noise in the training set. Our code is publicly available online: http://www.noisy-labels-in-rs.org
翻译:开发遥感图像多标签分类(MLC)的准确方法是RS公司最重要的研究课题之一。基于深革命神经网络(NCNNs)的方法显示,在RS 刚果解放运动的问题中取得了很强的绩效。然而,基于CNN的方法通常需要大量可靠的培训图像,通过多个土地覆盖类标签附加说明。收集这些数据既费时又费钱。为了解决这个问题,公开提供的主题产品,包括噪音标签,可以用来用零标签成本来说明RS的图像。然而,多标签噪音(可能与错误和缺失标签说明相关联)可以扭曲MLC运算法的学习过程。检测和纠正标签噪音是具有挑战性的任务,特别是在多标签情况下,每个图像都可以与多个覆盖类类标签标签标签标签标签标签标签挂勾。为了解决这个问题,我们建议采用一种新的噪音强的多标签协作学习(RCMLML)方法,以在CNN模型的培训阶段中减少任何多标签噪音的有害影响。RCMML识别、排名和排除当前多标签分类定义定义的网络,同时在两个模块中进行升级,在前的模型中进行变换变换。