Image animation generates a video of a source image following the motion of a driving video. State-of-the-art self-supervised image animation approaches warp the source image according to the motion of the driving video and recover the warping artifacts by inpainting. These approaches mostly use vanilla convolution for inpainting, and vanilla convolution does not distinguish between valid and invalid pixels. As a result, visual artifacts are still noticeable after inpainting. CutMix is a state-of-the-art regularization strategy that cuts and mixes patches of images and is widely studied in different computer vision tasks. Among the remaining computer vision tasks, warp-based image animation is one of the fields that the effects of CutMix have yet to be studied. This paper first presents a preliminary study on the effects of CutMix on warp-based image animation. We observed in our study that CutMix helps improve only pixel values, but disturbs the spatial relationships between pixels. Based on such observation, we propose PriorityCut, a novel augmentation approach that uses the top-k percent occluded pixels of the foreground to regularize warp-based image animation. By leveraging the domain knowledge in warp-based image animation, PriorityCut significantly reduces the warping artifacts in state-of-the-art warp-based image animation models on diverse datasets.
翻译:在驱动视频运动后, 图像动画会生成一个源图像的视频。 最先进的自我监督图像动画会根据驱动视频动画的动画对源图像进行扭曲, 并通过油漆来恢复扭曲的工艺品。 这些方法大多使用香草混在一起进行油漆, 香草混在一起不会区分有效与无效的像素。 结果, 视觉文物在涂漆后仍然可见。 切密密是一个最先进的正规化战略, 削减和混合图像, 并在不同的计算机视觉任务中广泛研究。 在其余的计算机视觉任务中, 以战争为基础的图像结构动画是尚未研究的领域之一。 本文首先介绍了关于切密混合对基于战争的像素动画动画的影响的初步研究。 我们在研究中发现, 切密像只帮助改善像素的值, 但却扰乱像素之间的空间关系。 基于这样的观察, 我们提议优先级Cut, 一种基于新式的放大法方法, 将最上百分位的动画图像模型用于战争的磁场图像的常规水平, 将战略图像降为战争的磁盘。