With the development of deep learning, single image super-resolution (SISR) has achieved significant breakthroughs. Recently, methods to enhance the performance of SISR networks based on global feature interactions have been proposed. However, the capabilities of neurons that need to adjust their function in response to the context dynamically are neglected. To address this issue, we propose a lightweight Cross-receptive Focused Inference Network (CFIN), a hybrid network composed of a Convolutional Neural Network (CNN) and a Transformer. Specifically, a novel Cross-receptive Field Guide Transformer (CFGT) is designed to adaptively modify the network weights by using modulated convolution kernels combined with local representative semantic information. In addition, a CNN-based Cross-scale Information Aggregation Module (CIAM) is proposed to make the model better focused on potentially practical information and improve the efficiency of the Transformer stage. Extensive experiments show that our proposed CFIN is a lightweight and efficient SISR model, which can achieve a good balance between computational cost and model performance.
翻译:随着深层学习的发展,单一图像超分辨率(SISR)取得了重大突破;最近,提出了以全球特征互动为基础提高SISR网络性能的方法;然而,需要根据环境动态调整其功能的神经元的能力受到忽视;为解决这一问题,我们提议建立一个轻量跨度跨感知焦点推断网络(CFIN),这是一个由革命神经网络(CNN)和变异器组成的混合网络;具体地说,一个新的跨感知场指南变异器(CFGT)旨在利用经调整的脉动内核与当地有代表性的语义信息一起适应性地改变网络重量;此外,还提议建立一个有线电视新闻网的跨规模信息集成模块(CIAM),使模型更加注重潜在的实用信息,提高变异器阶段的效率;广泛的实验表明,我们提议的CFIN是一种轻度和高效的SISR模型,可以在计算成本和模型性能之间实现良好的平衡。