Recent advances have shown how deep neural networks can be extremely effective at super-resolving remote sensing imagery, starting from a multitemporal collection of low-resolution images. However, existing models have neglected the issue of temporal permutation, whereby the temporal ordering of the input images does not carry any relevant information for the super-resolution task and causes such models to be inefficient with the, often scarce, ground truth data that available for training. Thus, models ought not to learn feature extractors that rely on temporal ordering. In this paper, we show how building a model that is fully invariant to temporal permutation significantly improves performance and data efficiency. Moreover, we study how to quantify the uncertainty of the super-resolved image so that the final user is informed on the local quality of the product. We show how uncertainty correlates with temporal variation in the series, and how quantifying it further improves model performance. Experiments on the Proba-V challenge dataset show significant improvements over the state of the art without the need for self-ensembling, as well as improved data efficiency, reaching the performance of the challenge winner with just 25% of the training data.
翻译:最近的进展表明,从多分辨率图像的多时收集开始,深心神经网络在超分辨率图像的解析中能够发挥极大效力,但现有模型忽视了时间变异问题,即输入图像的暂时顺序不包含任何与超分辨率任务相关的信息,并导致这种模型与可用于培训的往往稀缺的地面事实数据不起作用。因此,模型不应学习依赖时间顺序的地物提取器。在本文中,我们展示了如何建立一个完全不易于时间变异的模型,大大提高了性能和数据效率。此外,我们研究如何量化超解图像的不确定性,以便最终用户了解产品的当地质量。我们展示不确定性与系列中的时间变异如何相关,以及量化它如何进一步提高模型的性能。Proba-V挑战数据集的实验表明,在不需要自我组合以及提高数据效率的情况下,在艺术状态上取得了显著的改进,使挑战优胜者的表现只得到25%的培训数据。