探索视频框架内插的断断续性</s> (Exploring Discontinuity for Video Frame Interpolation)

Video frame interpolation (VFI) is the task that synthesizes the intermediate frame given two consecutive frames. Most of the previous studies have focused on appropriate frame warping operations and refinement modules for the warped frames. These studies have been conducted on natural videos containing only continuous motions. However, many practical videos contain various unnatural objects with discontinuous motions such as logos, user interfaces and subtitles. We propose three techniques to make the existing deep learning-based VFI architectures robust to these elements. First is a novel data augmentation strategy called figure-text mixing (FTM) which can make the models learn discontinuous motions during training stage without any extra dataset. Second, we propose a simple but effective module that predicts a map called discontinuity map (D-map), which densely distinguishes between areas of continuous and discontinuous motions. Lastly, we propose loss functions to give supervisions of the discontinuous motion areas which can be applied along with FTM and D-map. We additionally collect a special test benchmark called Graphical Discontinuous Motion (GDM) dataset consisting of some mobile games and chatting videos. Applied to the various state-of-the-art VFI networks, our method significantly improves the interpolation qualities on the videos from not only GDM dataset, but also the existing benchmarks containing only continuous motions such as Vimeo90K, UCF101, and DAVIS.

翻译：视频框架间插( VFI) 是将中间框架合成为连续两个框架的任务。以往的研究大多侧重于对扭曲框架进行适当的框架扭曲操作和完善模块。这些研究是在自然视频中进行的, 仅包含连续动作。然而, 许多实用视频包含各种非自然对象, 带有不连续动作, 如标志、用户界面和字幕。我们提出三种技术, 使现有的深层次学习的 VFI 架构能够对这些元素产生强大的作用。首先, 是一种叫作图形文本混合( FTM) 的新颖的数据增强战略, 它可以使模型在培训阶段学习不连续动作, 而不会有任何额外的数据集。其次, 我们提出一个简单而有效的模块, 预测称为不连续动作地图( D- 映射) 的地图( D- 映射), 它将连续动作和不连续动作( DAFIS ) 的域标比( G- DAFIS ) 。我们提议损失功能是为了监督不连续动作区域。我们另外收集一个名为图形不连续的 VDM( GDM) 数据设置数据设置由一些移动游戏和聊天视频组成系统,, 仅用于各种数据库的系统。</s>

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日