We study the problem of animating images by transferring spatio-temporal visual effects (such as melting) from a collection of videos. We tackle two primary challenges in visual effect transfer: 1) how to capture the effect we wish to distill; and 2) how to ensure that only the effect, rather than content or artistic style, is transferred from the source videos to the input image. To address the first challenge, we evaluate five loss functions; the most promising one encourages the generated animations to have similar optical flow and texture motions as the source videos. To address the second challenge, we only allow our model to move existing image pixels from the previous frame, rather than predicting unconstrained pixel values. This forces any visual effects to occur using the input image's pixels, preventing unwanted artistic style or content from the source video from appearing in the output. We evaluate our method in objective and subjective settings, and show interesting qualitative results which demonstrate objects undergoing atypical transformations, such as making a face melt or a deer bloom.
翻译:我们通过从一组视频中传输时空视觉效果(如熔化)来研究图像动画问题。 我们在视觉效果传输中应对两个主要挑战:(1) 如何捕捉我们希望蒸馏的效果;(2) 如何确保仅从源视频中将效果而不是内容或艺术风格从源视频转移到输入图像。 为了应对第一个挑战,我们评估了五个损失功能;最有希望的功能鼓励生成的动画具有与源视频类似的光学流和纹理运动。为了应对第二个挑战,我们只允许我们的模型将现有图像像素从上一个框架移走,而不是预测未受限制的像素值。这迫使任何视觉效应都使用输入图像的像素发生,防止源视频中不必要的艺术风格或内容出现在输出中。我们评估了在客观和主观环境中的方法,并展示了有趣的质量结果,显示正在经历异常变异的物体,如面熔化或脱色的膨胀。