用于改造视频超级分辨率的跨分辨率流动传播 (Cross-Resolution Flow Propagation for Foveated Video Super-Resolution)

The demand of high-resolution video contents has grown over the years. However, the delivery of high-resolution video is constrained by either computational resources required for rendering or network bandwidth for remote transmission. To remedy this limitation, we leverage the eye trackers found alongside existing augmented and virtual reality headsets. We propose the application of video super-resolution (VSR) technique to fuse low-resolution context with regional high-resolution context for resource-constrained consumption of high-resolution content without perceivable drop in quality. Eye trackers provide us the gaze direction of a user, aiding us in the extraction of the regional high-resolution context. As only pixels that falls within the gaze region can be resolved by the human eye, a large amount of the delivered content is redundant as we can't perceive the difference in quality of the region beyond the observed region. To generate a visually pleasing frame from the fusion of high-resolution region and low-resolution region, we study the capability of a deep neural network of transferring the context of the observed region to other regions (low-resolution) of the current and future frames. We label this task a Foveated Video Super-Resolution (FVSR), as we need to super-resolve the low-resolution regions of current and future frames through the fusion of pixels from the gaze region. We propose Cross-Resolution Flow Propagation (CRFP) for FVSR. We train and evaluate CRFP on REDS dataset on the task of 8x FVSR, i.e. a combination of 8x VSR and the fusion of foveated region. Departing from the conventional evaluation of per frame quality using SSIM or PSNR, we propose the evaluation of past foveated region, measuring the capability of a model to leverage the noise present in eye trackers during FVSR. Code is made available at https://github.com/eugenelet/CRFP.

翻译：高清晰度视频内容的需求逐年增加,然而,高清晰度视频内容的需求逐年增加,但高清晰度视频的提供受到以下因素的限制:提供或网络带宽所需的计算资源,以进行远程传输。为了纠正这一限制,我们利用现有增强和虚拟现实头戴的视觉跟踪器。我们提议采用视频超分辨率(VSR)技术,将低清晰度背景与区域高分辨率背景结合起来,以便以资源限制的方式消费高清晰度内容,而不会出现质量下降。目视跟踪器为我们提供了一个用户的视线方向,帮助我们提取区域高清晰度视频背景。由于只有显示区域内的像素才能被人类眼睛所解决,因此大量交付的内容是多余的,因为我们无法察觉到所观测区域以外的区域的质量差异。为了从高清晰度区域或低分辨率区域中产生一个视觉模型,我们研究将观测区域的背景(低清晰度)转移到当前和未来框架。我们将当前和今后框架的软化视频-S-SR(FVR)的当前快速度数据流流数据从当前版本区域向未来版本。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

专知会员服务

49+阅读 · 2020年2月25日