Compression plays an important role on the efficient transmission and storage of images and videos through band-limited systems such as streaming services, virtual reality or videogames. However, compression unavoidably leads to artifacts and the loss of the original information, which may severely degrade the visual quality. For these reasons, quality enhancement of compressed images has become a popular research topic. While most state-of-the-art image restoration methods are based on convolutional neural networks, other transformers-based methods such as SwinIR, show impressive performance on these tasks. In this paper, we explore the novel Swin Transformer V2, to improve SwinIR for image super-resolution, and in particular, the compressed input scenario. Using this method we can tackle the major issues in training transformer vision models, such as training instability, resolution gaps between pre-training and fine-tuning, and hunger on data. We conduct experiments on three representative tasks: JPEG compression artifacts removal, image super-resolution (classical and lightweight), and compressed image super-resolution. Experimental results demonstrate that our method, Swin2SR, can improve the training convergence and performance of SwinIR, and is a top-5 solution at the "AIM 2022 Challenge on Super-Resolution of Compressed Image and Video".
翻译:在通过流流服务、虚拟现实或视频游戏等带宽系统高效传输和存储图像和视频方面,压缩在通过流传服务、虚拟现实或视频游戏等带宽系统高效传输和存储图像和视频方面起着重要作用。然而,压缩不可避免地导致艺术品和原始信息的丢失,从而可能严重降低视觉质量。由于这些原因,压缩图像的质量提升已成为一个受欢迎的研究课题。虽然大多数最先进的图像恢复方法都以革命性神经网络为基础,SwinIR等其他基于变压器的方法也显示了在这些任务上令人印象深刻的表现。在本文中,我们探索了新颖的Swin变压器V2,以改进图像超分辨率的SwinIR,特别是压缩输入情景。利用这一方法,我们可以解决培训变压器视觉模型中的主要问题,例如培训不稳定性、培训前和微调之间的分辨率差距以及数据饥饿。我们在三项具有代表性的任务上进行了实验:JPEGEG压缩工艺品清除、图像超分辨率(经典和轻度)和压缩图像超分辨率超分辨率。实验结果表明,我们的方法、Swin2SR、SwinIM和Swin-25图像最高级分辨率解决方案的整合和SwinIMS-S-Swin-S-S-S-S-Sy-S-S-S-S-S-S-25-S-S-S-S-S-S-VID-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-