Video frame synthesis, which consists of interpolation and extrapolation, is an essential video processing technique that can be applied to various scenarios. However, most existing methods cannot handle small objects or large motion well, especially in high-resolution videos such as 4K videos. To eliminate such limitations, we introduce a neighbor correspondence matching (NCM) algorithm for flow-based frame synthesis. Since the current frame is not available in video frame synthesis, NCM is performed in a current-frame-agnostic fashion to establish multi-scale correspondences in the spatial-temporal neighborhoods of each pixel. Based on the powerful motion representation capability of NCM, we further propose to estimate intermediate flows for frame synthesis in a heterogeneous coarse-to-fine scheme. Specifically, the coarse-scale module is designed to leverage neighbor correspondences to capture large motion, while the fine-scale module is more computationally efficient to speed up the estimation process. Both modules are trained progressively to eliminate the resolution gap between training dataset and real-world videos. Experimental results show that NCM achieves state-of-the-art performance on several benchmarks. In addition, NCM can be applied to various practical scenarios such as video compression to achieve better performance.
翻译:由内推和外推法构成的视频框架合成是一种基本的视频处理技术,可以应用于各种情景。然而,大多数现有方法都无法很好地处理小物体或大动作,特别是在4K视频等高分辨率视频中。为了消除这些限制,我们引入了用于流动框架合成的邻居通信匹配算法。由于视频框架合成中没有目前的框架,因此,NCM是以当前框架的不可知方式进行,以便在每个像素的空间时空区建立多尺度通信。根据NCM的强大运动代表能力,我们进一步建议估计中间流量,以便在一个千差万别的粗略到平底计划中进行框架合成。具体地说,粗略的模块旨在利用邻居通信获取大动作,而微规模模块则更具有计算效率,以加快估算过程。两个模块都经过了培训,以逐步消除培训数据集与现实世界视频之间的分辨率差距。实验结果显示,NCMM在几个基准基准上实现了最先进的业绩。此外,NCM可以将各种实际性业绩应用到各种图像中。