空中图像分类的深域适应 (Appearance Based Deep Domain Adaptation for the Classification of Aerial Images)

This paper addresses domain adaptation for the pixel-wise classification of remotely sensed data using deep neural networks (DNN) as a strategy to reduce the requirements of DNN with respect to the availability of training data. We focus on the setting in which labelled data are only available in a source domain DS, but not in a target domain DT. Our method is based on adversarial training of an appearance adaptation network (AAN) that transforms images from DS such that they look like images from DT. Together with the original label maps from DS, the transformed images are used to adapt a DNN to DT. We propose a joint training strategy of the AAN and the classifier, which constrains the AAN to transform the images such that they are correctly classified. In this way, objects of a certain class are changed such that they resemble objects of the same class in DT. To further improve the adaptation performance, we propose a new regularization loss for the discriminator network used in domain adversarial training. We also address the problem of finding the optimal values of the trained network parameters, proposing an unsupervised entropy based parameter selection criterion which compensates for the fact that there is no validation set in DT that could be monitored. As a minor contribution, we present a new weighting strategy for the cross-entropy loss, addressing the problem of imbalanced class distributions. Our method is evaluated in 42 adaptation scenarios using datasets from 7 cities, all consisting of high-resolution digital orthophotos and height data. It achieves a positive transfer in all cases, and on average it improves the performance in the target domain by 4.3% in overall accuracy. In adaptation scenarios between datasets from the ISPRS semantic labelling benchmark our method outperforms those from recent publications by 10-20% with respect to the mean intersection over union.

翻译：本文用深神经网络(DNN)作为降低 DNN对培训数据可用性的要求的战略,处理遥感数据像像素的分类的域性适应性。我们侧重于标签数据仅在源域 DS 中提供,而不是在目标域DT。我们的方法基于一个外观适应网络的对抗性培训(AAN), 将图像从 DS 转换成像 DT 图像。加上DS 的原始标签地图, 转换后的图像被用来将 DNN 与 DT 相适应。我们建议采用 AAN 和分类器的联合培训战略, 降低 DNNNN 的要求, 降低 DNNW 的可用性能要求, 限制 AAN 转换图像, 使其在源域域域域域域域域域 DT 中进行正确的分类。为了进一步尊重适应性能, 我们提议对在域对域对调试训练中使用的制网络进行新的规范性损失。我们还处理如何找到经过训练的网络参数的最佳值的问题, 提议在级级内进行不经过校正性变动的内值的变校正的变校正性变校正性值, 目标值目标值目标值标值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值值