Existing deep learning-based video super-resolution (SR) methods usually depend on the supervised learning approach, where the training data is usually generated by the blurring operation with known or predefined kernels (e.g., Bicubic kernel) followed by a decimation operation. However, this does not hold for real applications as the degradation process is complex and cannot be approximated by these idea cases well. Moreover, obtaining high-resolution (HR) videos and the corresponding low-resolution (LR) ones in real-world scenarios is difficult. To overcome these problems, we propose a self-supervised learning method to solve the blind video SR problem, which simultaneously estimates blur kernels and HR videos from the LR videos. As directly using LR videos as supervision usually leads to trivial solutions, we develop a simple and effective method to generate auxiliary paired data from original LR videos according to the image formation of video SR, so that the networks can be better constrained by the generated paired data for both blur kernel estimation and latent HR video restoration. In addition, we introduce an optical flow estimation module to exploit the information from adjacent frames for HR video restoration. Experiments show that our method performs favorably against state-of-the-art ones on benchmarks and real-world videos.
翻译:现有基于深层次学习的视频超分辨率(SR)方法通常取决于监督的学习方法,在这种方法中,培训数据通常是由使用已知或预设的内核(例如比库比奇内核)的模糊操作生成的,然后是消亡操作。然而,由于降解过程复杂,无法很好地与这些设想案例相近,这并没有真正应用。此外,在现实世界情景中获取高分辨率(HR)视频和相应的低分辨率(LR)视频很困难。为了克服这些问题,我们提议一种自监督的学习方法来解决盲目的视频SR问题,即同时估计LR视频中的模糊内核和HR视频。正如直接使用LR视频作为监管通常导致微不足道的解决办法一样,我们开发了一种简单有效的方法,根据视频SR的图像形成,从原始的LR视频生成辅助性数据,以便网络能够更好地受生成的相配对数据的限制,用于模糊内核估和潜在人力资源视频恢复。此外,我们还引入了光流估算模块模型,以便利用从相邻的图像基准中获取信息。