We propose a novel DNN based framework called the Enhanced Correlation Matching based Video Frame Interpolation Network to support high resolution like 4K, which has a large scale of motion and occlusion. Considering the extensibility of the network model according to resolution, the proposed scheme employs the recurrent pyramid architecture that shares the parameters among each pyramid layer for optical flow estimation. In the proposed flow estimation, the optical flows are recursively refined by tracing the location with maximum correlation. The forward warping based correlation matching enables to improve the accuracy of flow update by excluding incorrectly warped features around the occlusion area. Based on the final bi-directional flows, the intermediate frame at arbitrary temporal position is synthesized using the warping and blending network and it is further improved by refinement network. Experiment results demonstrate that the proposed scheme outperforms the previous works at 4K video data and low-resolution benchmark datasets as well in terms of objective and subjective quality with the smallest number of model parameters.
翻译:我们提出一个新的基于 DNN 的新型框架,称为强化关联匹配基基视频框架内插网络,以支持像4K这样的高分辨率,它具有巨大的运动和隐蔽性。考虑到网络模型的可扩展性,拟议办法采用反复出现的金字塔结构,在每一个金字塔层之间共享参数以进行光学流量估计。在拟议的流量估算中,光学流通过以最大相关性跟踪位置而循环地进行精细化。基于前向扭曲基比对能够通过排除闭塞区周围错误扭曲特征来提高流量更新的准确性。根据最终双向流动,利用扭曲和混合网络对任意时间位置的中间框架进行合成,并通过改进网络加以进一步改进。实验结果显示,拟议办法比先前的4K 视频数据和低分辨率基准数据集的4K,以及客观和主观质量与最小数量的模型参数相比,比以往的工程更优。