Learning continuous image representations is recently gaining popularity for image super-resolution (SR) because of its ability to reconstruct high-resolution images with arbitrary scales from low-resolution inputs. Existing methods mostly ensemble nearby features to predict the new pixel at any queried coordinate in the SR image. Such a local ensemble suffers from some limitations: i) it has no learnable parameters and it neglects the similarity of the visual features; ii) it has a limited receptive field and cannot ensemble relevant features in a large field which are important in an image; iii) it inherently has a gap with real camera imaging since it only depends on the coordinate. To address these issues, this paper proposes a continuous implicit attention-in-attention network, called CiaoSR. We explicitly design an implicit attention network to learn the ensemble weights for the nearby local features. Furthermore, we embed a scale-aware attention in this implicit attention network to exploit additional non-local information. Extensive experiments on benchmark datasets demonstrate CiaoSR significantly outperforms the existing single image super resolution (SISR) methods with the same backbone. In addition, the proposed method also achieves the state-of-the-art performance on the arbitrary-scale SR task. The effectiveness of the method is also demonstrated on the real-world SR setting. More importantly, CiaoSR can be flexibly integrated into any backbone to improve the SR performance.
翻译:最近,学习连续图像表征越来越受到图像超分辨率(SR)的欢迎,因为它有能力重建高分辨率图像,从低分辨率输入的任意比例上任意调整图像; 现有方法主要是将附近各种特征合在一起,在SR图像中任何询问的坐标处预测新的像素。 这种本地组合受到一些限制:(一) 它没有可学习的参数,忽视了视觉特征的相似性;(二) 它的可接受域有限,无法在对图像很重要的大领域将相关特性组合在一起; 三) 它与真正的摄影成像有内在差距,因为它仅取决于协调; 为了解决这些问题,本文件建议持续隐含的注意网络,称为CiaoSR。 我们明确设计一个隐含的注意网络,以学习附近本地特征的共性重量; 此外,我们在这个隐含的注意网络中嵌入一个能度的注意范围,以利用更多的非本地信息。 基准数据集的广泛实验显示,CiaoSR与现有的单一图像超分辨率(SISR)存在差距,因为它只取决于协调。为了解决这些问题,本文件建议的一个持续隐含的注意网络网络,我们明确设计一个隐含的注意网络。 此外,拟议的S-SR的进度上的任何方法也可以改进。