While single-image super-resolution (SISR) has attracted substantial interest in recent years, the proposed approaches are limited to learning image priors in order to add high frequency details. In contrast, multi-frame super-resolution (MFSR) offers the possibility of reconstructing rich details by combining signal information from multiple shifted images. This key advantage, along with the increasing popularity of burst photography, have made MFSR an important problem for real-world applications. We propose a novel architecture for the burst super-resolution task. Our network takes multiple noisy RAW images as input, and generates a denoised, super-resolved RGB image as output. This is achieved by explicitly aligning deep embeddings of the input frames using pixel-wise optical flow. The information from all frames are then adaptively merged using an attention-based fusion module. In order to enable training and evaluation on real-world data, we additionally introduce the BurstSR dataset, consisting of smartphone bursts and high-resolution DSLR ground-truth. We perform comprehensive experimental analysis, demonstrating the effectiveness of the proposed architecture.
翻译:虽然近年来单一图像超分辨率(SISSR)引起了极大的兴趣,但拟议的方法仅限于学习图像前科,以便添加高频细节。相比之下,多框架超分辨率(MFSR)通过将多变图像的信号信息合并,提供了重建丰富细节的可能性。这一关键优势,加上爆破摄影越来越受欢迎,使MFSR成为现实世界应用的一个重要问题。我们提出了爆破超级分辨率任务的新结构。我们的网络将多个噪音的RAW图像作为输入,并生成一个被删除的、超解的 RGB 图像作为输出。这是通过使用像素光学流对输入框架的深度嵌入进行明确调整实现的。所有框架的信息随后通过基于关注的聚合模块进行适应性整合。为了能够对真实世界数据进行培训和评价,我们还介绍了BurstSR数据集,其中包括智能闪射和高分辨率DSLRLL地面图。我们进行了全面的实验分析,展示了拟议架构的有效性。