Super-resolution is a fundamental problem in computer vision which aims to overcome the spatial limitation of camera sensors. While significant progress has been made in single image super-resolution, most algorithms only perform well on synthetic data, which limits their applications in real scenarios. In this paper, we study the problem of real-scene single image super-resolution to bridge the gap between synthetic data and real captured images. We focus on two issues of existing super-resolution algorithms: lack of realistic training data and insufficient utilization of visual information obtained from cameras. To address the first issue, we propose a method to generate more realistic training data by mimicking the imaging process of digital cameras. For the second issue, we develop a two-branch convolutional neural network to exploit the radiance information originally-recorded in raw images. In addition, we propose a dense channel-attention block for better image restoration as well as a learning-based guided filter network for effective color correction. Our model is able to generalize to different cameras without deliberately training on images from specific camera types. Extensive experiments demonstrate that the proposed algorithm can recover fine details and clear structures, and achieve high-quality results for single image super-resolution in real scenes.
翻译:超级分辨率是计算机视觉中一个根本问题,它的目的是克服相机传感器的空间限制。虽然在单一图像超分辨率方面取得了显著进展,但大多数算法只对合成数据产生了良好的效果,这限制了合成数据的应用。在本文中,我们研究了真实摄像单一图像超分辨率的问题,以弥合合成数据和真实摄像之间的鸿沟。我们侧重于现有的超级分辨率算法的两个问题:缺乏现实的培训数据和对从相机获得的视觉信息的利用不足。为了解决第一个问题,我们提出了一个方法,通过模拟数码相机的成像过程来生成更现实的培训数据。关于第二个问题,我们开发了一个两管的共振动神经网络,以利用最初在原始图像中记录的亮光信息。此外,我们提出了一个密集的频道关注屏障,以更好地恢复图像,以及一个基于学习的、有指导的过滤网络,以进行有效的色彩校正。我们的模型能够在不对特定相机类型的图像进行有意培训的情况下向不同的照相机进行普及。广泛的实验表明,拟议的算法可以恢复精确的细节和清晰的结构,并在真实的图像中取得高质量的结果。