Landsat-8 (NASA) and Sentinel-2 (ESA) are two prominent multi-spectral imaging satellite projects that provide publicly available data. The multi-spectral imaging sensors of the satellites capture images of the earth's surface in the visible and infrared region of the electromagnetic spectrum. Since the majority of the earth's surface is constantly covered with clouds, which are not transparent at these wavelengths, many images do not provide much information. To increase the temporal availability of cloud-free images of a certain area, one can combine the observations from multiple sources. However, the sensors of satellites might differ in their properties, making the images incompatible. This work provides a first glance at the possibility of using a transformer-based model to reduce the spectral and spatial differences between observations from both satellite projects. We compare the results to a model based on a fully convolutional UNet architecture. Somewhat surprisingly, we find that, while deep models outperform classical approaches, the UNet significantly outperforms the transformer in our experiments.
翻译:Landsat-8(NASA)和Sentinel-2(ESA)是两个提供公开数据的重要多光谱成像卫星项目。卫星在电磁频谱的可见和红外区域捕捉地球表面图像的多光谱成像传感器。由于地球表面大部分不断覆盖云层,这些云层在这些波长上不透明,许多图像不能提供很多信息。为了增加某一区域无云图像的暂时可用性,可以将多个来源的观测结果结合起来。然而,卫星传感器的特性可能不同,使图像不相容。这项工作提供了对使用变压器模型缩小两个卫星项目观测光谱和空间差异的可能性的第一次审视。我们把这些结果与基于完全革命 Uet 结构的模型进行比较。有些令人惊讶的是,我们发现,虽然深模型优于古典方法,但UNet在我们的实验中大大优于变压器。