Deep learning based methods, especially convolutional neural networks (CNNs) have been successfully applied in the field of single image super-resolution (SISR). To obtain better fidelity and visual quality, most of existing networks are of heavy design with massive computation. However, the computation resources of modern mobile devices are limited, which cannot easily support the expensive cost. To this end, this paper explores a novel frequency-aware dynamic network for dividing the input into multiple parts according to its coefficients in the discrete cosine transform (DCT) domain. In practice, the high-frequency part will be processed using expensive operations and the lower-frequency part is assigned with cheap operations to relieve the computation burden. Since pixels or image patches belong to low-frequency areas contain relatively few textural details, this dynamic network will not affect the quality of resulting super-resolution images. In addition, we embed predictors into the proposed dynamic network to end-to-end fine-tune the handcrafted frequency-aware masks. Extensive experiments conducted on benchmark SISR models and datasets show that the frequency-aware dynamic network can be employed for various SISR neural architectures to obtain the better tradeoff between visual quality and computational complexity. For instance, we can reduce the FLOPs of EDSR model by approximate $50\%$ while preserving state-of-the-art SISR performance.
翻译:深度学习方法,特别是革命神经网络(CNNs)已经成功地应用于单一图像超分辨率(SISR)领域。为了获得更忠实和视觉质量,大多数现有网络都是大量计算设计的重型设计。然而,现代移动设备的计算资源有限,无法轻易地支持昂贵的成本。为此,本文件探索了一个新型的频率感知动态网络,以便根据离子子线变换(DCT)域域的系数将输入分解成多个部分。在实践中,高频部分将使用昂贵的操作处理,低频部分将使用廉价的操作进行处理,而低频部分则配有廉价的操作来减轻计算负担。由于像素或图像补丁属于低频区域,含有相对较少的纹理细节,因此这种动态网络不会影响所产生的超分辨率图像的质量。此外,我们将预测器嵌入拟议的动态网络,以端端至端微调手制的频率感知觉面具。在基准的SISSR模型模型和数据设置上进行的广泛实验表明,在各种SISR型号复杂度测试中,我们可以使用频率感官动态网络,同时进行各种SISLIMR的精度测试。