StysteleGAN2 冷空空间 (Expressive Talking Head Video Encoding in StyleGAN2 Latent-Space)

While the recent advances in research on video reenactment have yielded promising results, the approaches fall short in capturing the fine, detailed, and expressive facial features (e.g., lip-pressing, mouth puckering, mouth gaping, and wrinkles) which are crucial in generating realistic animated face videos. To this end, we propose an end-to-end expressive face video encoding approach that facilitates data-efficient high-quality video re-synthesis by optimizing low-dimensional edits of a single Identity-latent. The approach builds on StyleGAN2 image inversion and multi-stage non-linear latent-space editing to generate videos that are nearly comparable to input videos. While existing StyleGAN latent-based editing techniques focus on simply generating plausible edits of static images, we automate the latent-space editing to capture the fine expressive facial deformations in a sequence of frames using an encoding that resides in the Style-latent-space (StyleSpace) of StyleGAN2. The encoding thus obtained could be super-imposed on a single Identity-latent to facilitate re-enactment of face videos at $1024^2$. The proposed framework economically captures face identity, head-pose, and complex expressive facial motions at fine levels, and thereby bypasses training, person modeling, dependence on landmarks/ keypoints, and low-resolution synthesis which tend to hamper most re-enactment approaches. The approach is designed with maximum data efficiency, where a single $W+$ latent and 35 parameters per frame enable high-fidelity video rendering. This pipeline can also be used for puppeteering (i.e., motion transfer).

翻译：虽然最近对视频再演化的研究取得了令人乐观的进展,但方法在捕捉细细、详细和表情面部特征(如唇压、口腔拉紧、口隔开和皱纹)方面进展不尽如人意,这些特征对于产生现实的动画脸视频至关重要。为此,我们建议采用端到端的表情图像编码方法,通过优化单一身份latient(SteleGAN2)的低维编辑方式,促进数据高效的高质量视频再合成。SteleGAN2的低维度编辑方法基于SteleGAN2图像的倒置和多级的非线性潜伏空间编辑方法,以生成几乎与输入视频相近的视频。虽然现有的StelegGAN的隐性编辑技术侧重于仅仅生成真实的静态图像修改,但我们将潜空的图像编辑方法自动化,以便利用Steleg-lat-lat-space(Stylespace)中包含的编码,由此获得的母体编码,从而可以在一个单一身份-相对的图像转换和最深层直径直径直径的图像框架上转换的图像转换工具上,从而在1024上进行快速的图像的图像分析。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日