This paper proposes a gaze correction and animation method for high-resolution, unconstrained portrait images, which can be trained without the gaze angle and the head pose annotations. Common gaze-correction methods usually require annotating training data with precise gaze, and head pose information. Solving this problem using an unsupervised method remains an open problem, especially for high-resolution face images in the wild, which are not easy to annotate with gaze and head pose labels. To address this issue, we first create two new portrait datasets: CelebGaze and high-resolution CelebHQGaze. Second, we formulate the gaze correction task as an image inpainting problem, addressed using a Gaze Correction Module (GCM) and a Gaze Animation Module (GAM). Moreover, we propose an unsupervised training strategy, i.e., Synthesis-As-Training, to learn the correlation between the eye region features and the gaze angle. As a result, we can use the learned latent space for gaze animation with semantic interpolation in this space. Moreover, to alleviate both the memory and the computational costs in the training and the inference stage, we propose a Coarse-to-Fine Module (CFM) integrated with GCM and GAM. Extensive experiments validate the effectiveness of our method for both the gaze correction and the gaze animation tasks in both low and high-resolution face datasets in the wild and demonstrate the superiority of our method with respect to the state of the arts. Code is available at https://github.com/zhangqianhui/GazeAnimationV2
翻译:本文为高清晰度、 不受限制的肖像图像提出了视觉校正和动画方法, 可以在没有凝视角度和头部显示说明的情况下进行培训。 普通的凝视校正方法通常需要用精确的凝视和头部显示信息。 使用未经监督的方法解决这个问题仍然是个开放的问题, 特别是对于高清晰度的野生图像而言, 不容易用凝视和头部显示标签来说明。 为了解决这个问题, 我们首先创建了两个新的画像数据集: CalebGaze 和 高清晰度的CelebHQGaze。 其次, 我们将凝视校正任务设计成一个映像问题, 使用Gaz校正模块(GCM) 和 Gaze Aimation 模块(GM ) 解决问题。 此外, 我们提出一个不受监督的培训策略, 即合成Astaima- As- Asreduction, 学习眼睛区域特征和凝视角度角度之间的关联。 因此, 我们可以利用所学的隐含的潜影空间内以内测的图像内插图。 此外, 我们的读和直观的校正的校正的校正方法中, 我们的校正的校正的校正的校正方法, 我们的校正的校正的校正的校正的校正的校正的校正的校正的校正的校正的校正的校正方法, 和校正的校正的校正方法, 是我们的校正的校正的校正的校正的校正的校正的校正的校正的校正的校正。