Recently, several studies have applied deep convolutional neural networks (CNNs) in image compressive sensing (CS) tasks to improve reconstruction quality. However, convolutional layers generally have a small receptive field; therefore, capturing long-range pixel correlations using CNNs is challenging, which limits their reconstruction performance in image CS tasks. Considering this limitation, we propose a U-shaped transformer for image CS tasks, called the Uformer-ICS. We develop a projection-based transformer block by integrating the prior projection knowledge of CS into the original transformer blocks, and then build a symmetrical reconstruction model using the projection-based transformer blocks and residual convolutional blocks. Compared with previous CNN-based CS methods that can only exploit local image features, the proposed reconstruction model can simultaneously utilize the local features and long-range dependencies of an image, and the prior projection knowledge of the CS theory. Additionally, we design an adaptive sampling model that can adaptively sample image blocks based on block sparsity, which can ensure that the compressed results retain the maximum possible information of the original image under a fixed sampling ratio. The proposed Uformer-ICS is an end-to-end framework that simultaneously learns the sampling and reconstruction processes. Experimental results demonstrate that it achieves significantly better reconstruction performance than existing state-of-the-art deep learning-based CS methods.
翻译:最近,一些研究在图像压缩测量(CS)任务中应用了深层进化神经网络(CNN)来提高重建质量。然而,进化层通常有一个小的可接收场;因此,利用CNN获取长距离像素关联性具有挑战性,这限制了其在图像CS任务中的重建性能。考虑到这一局限性,我们提议为图像CS任务使用一个U形变异器,称为Unex-ICS。我们开发了一个基于预测的变压器变压器块,将CS先前的预测知识纳入原始变压器块,然后利用基于预测的变压器块和剩余相动块构建一个对称重建模型。与以前基于CNNCS的CS方法相比,只有利用当地图像特征才能捕捉到长距离的像素关联性关联性关系。考虑到这一局限性,我们建议为图像CS理论的先前预测性能设计一个基于块状震动的适应性抽样块块块,我们设计一个适应性抽样模型,这样就可以确保压缩结果保留在固定取样率比率下原始图像的最大可能的信息。与基于固定的C级取样率比例比例,而拟议的C级再研究框架则同时学习了现有试算结果,以同时展示结果,以演示结果演示结果显示目前的C。