Despite the plethora of successful Super-Resolution Reconstruction (SRR) models applied to natural images, their application to remote sensing imagery tends to produce poor results. Remote sensing imagery is often more complicated than natural images and has its peculiarities such as being of lower resolution, it contains noise, and often depicting large textured surfaces. As a result, applying non-specialized SRR models on remote sensing imagery results in artifacts and poor reconstructions. To address these problems, this paper proposes an architecture inspired by previous research work, introducing a novel approach for forcing an SRR model to output realistic remote sensing images: instead of relying on feature-space similarities as a perceptual loss, the model considers pixel-level information inferred from the normalized Digital Surface Model (nDSM) of the image. This strategy allows the application of better-informed updates during the training of the model which sources from a task (elevation map inference) that is closely related to remote sensing. Nonetheless, the nDSM auxiliary information is not required during production and thus the model infers a super-resolution image without any additional data besides its low-resolution pairs. We assess our model on two remotely sensed datasets of different spatial resolutions that also contain the DSM pairs of the images: the DFC2018 dataset and the dataset containing the national Lidar fly-by of Luxembourg. Based on visual inspection, the inferred super-resolution images exhibit particularly superior quality. In particular, the results for the high-resolution DFC2018 dataset are realistic and almost indistinguishable from the ground truth images.
翻译:尽管对自然图像应用了大量成功的超分辨率重建(SRR)模型,但在对遥感图像的应用中却往往产生不良的结果。遥感图像往往比自然图像复杂得多,而且具有较低的分辨率等特性,含有噪音,并常常描绘大量的纹理表面。因此,在遥感图像中应用非专门性SRR模型导致人工制品和重建不善。为了解决这些问题,本文件提出了一个由以往研究工作所启发的结构,引入了一种新颖的方法,迫使SRR模型输出现实的遥感图像:该模型不依赖地貌空间相似性作为感知质量损失,而是考虑从图像的普通数字表面模型(nDSM)中推断出像素级信息。因此,在培训模型期间,应用了更加知情的最新信息,该模型来源于与遥感密切相关的任务(升幅推断)中。然而,在制作期间,不需采用nDSM20型辅助信息,因此模型推导出一个超分辨率图像,而除了其低分辨率质量损失外,该模型还考虑从图像的普通地面模型中推断出像级级级信息,我们还在SDFS-SDS-SDS-S-SDSD的高级图像中特别地评估了两部数据。我们在SD-SD-SD-SD-S-SD-SD-SD-SD-SD-SD-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-SD-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-SD-SD-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-