Production-level workflows for producing convincing 3D dynamic human faces have long relied on an assortment of labor-intensive tools for geometry and texture generation, motion capture and rigging, and expression synthesis. Recent neural approaches automate individual components but the corresponding latent representations cannot provide artists with explicit controls as in conventional tools. In this paper, we present a new learning-based, video-driven approach for generating dynamic facial geometries with high-quality physically-based assets. For data collection, we construct a hybrid multiview-photometric capture stage, coupling with ultra-fast video cameras to obtain raw 3D facial assets. We then set out to model the facial expression, geometry and physically-based textures using separate VAEs where we impose a global MLP based expression mapping across the latent spaces of respective networks, to preserve characteristics across respective attributes. We also model the delta information as wrinkle maps for the physically-based textures, achieving high-quality 4K dynamic textures. We demonstrate our approach in high-fidelity performer-specific facial capture and cross-identity facial motion retargeting. In addition, our multi-VAE-based neural asset, along with the fast adaptation schemes, can also be deployed to handle in-the-wild videos. Besides, we motivate the utility of our explicit facial disentangling strategy by providing various promising physically-based editing results with high realism. Comprehensive experiments show that our technique provides higher accuracy and visual fidelity than previous video-driven facial reconstruction and animation methods.
翻译:制作令人信服的 3D 动态人类面孔的生产水平工作流程长期以来依赖于各种劳动密集型工具的组合,这些工具用于几何和纹理生成、运动捕捉和操纵,以及表达合成。最近神经对个体组件采取自动处理方法,但相应的潜在代表无法像常规工具那样为艺术家提供明确的控制。在本文中,我们展示了一种新的基于学习的、视频驱动的方法,用高质量的物理资产生成动态面部偏差。在数据收集方面,我们构建了一个混合多视光度抓取阶段,与超快视频相机联在一起,以获取原始的 3D 面部资产。然后我们开始用不同的 VAEE 来模拟面部表达、几何和基于物理的纹理的纹理。此外,我们还可以将基于全球 MLP 的表达图像绘制出一个跨越各个网络潜在空间的清晰的图像图,以保存各个属性的特征。我们还将三角图信息建模成为基于物理的纹理的微缩图,实现高质量的 4K 动态质谱。我们展示了高端度的直径直径的面镜拍摄和跨直径直径直径直径的面图,我们的方法可以提供直观的直观的直观的直径图像的图像再展示。