The performance of a semantic segmentation model for remote sensing (RS) images pretrained on an annotated dataset would greatly decrease when testing on another unannotated dataset because of the domain gap. Adversarial generative methods, e.g., DualGAN, are utilized for unpaired image-to-image translation to minimize the pixel-level domain gap, which is one of the common approaches for unsupervised domain adaptation (UDA). However, existing image translation methods are facing two problems when performing RS images translation: 1) ignoring the scale discrepancy between two RS datasets which greatly affect the accuracy performance of scale-invariant objects, 2) ignoring the characteristic of real-to-real translation of RS images which brings an unstable factor for the training of the models. In this paper, ResiDualGAN is proposed for RS images translation, where a resizer module is used for addressing the scale discrepancy of RS datasets, and a residual connection is used for strengthening the stability of real-to-real images translation and improving the performance in cross-domain semantic segmentation tasks. Combining with an output space adaptation method, the proposed method greatly improves the accuracy performance on common benchmarks, which demonstrates the superiority and reliability of ResiDuanGAN. At the end of the paper, a thorough discussion is also conducted to give a reasonable explanation for the improvement of ResiDualGAN.
翻译:在附加注释的数据集上预先训练的遥感图像的语义分解模型的性能将大大降低,因为由于域差,在测试另一个未附加注释的数据集时,如果测试另一个未加注释的数据集,将大大降低遥感图像的语义分解模型的性能。 反基因方法,例如DualGAN,将被用于不受欢迎的图像到图像的翻译,以尽量减少像素级域差距,这是在不受监督的域适应方面常见的方法之一。 然而,现有的图像翻译方法在执行RS图像翻译时面临着两个问题:(1) 无视RS两个数据集之间的比例差异,这大大影响比额表变异性对象的精确性能;(2) 无视RS图像的真实到真实翻译特点,从而给模型培训带来不稳定的因素。 本文中,ResiDualGAN 提议用于RS图像翻译,用一个再生精度模块来解决RS数据集的规模差异,而剩余连接用于加强真实到真实图像翻译的稳定性,并改进跨内置的语义分解的性能性能性能的特性,G在高的精确度上,将SUAL-A的精确性调整方法转化为。