以实物为基础的用于生产的经济资产 (Video-driven Neural Physically-based Facial Asset for Production)

from arxiv, For project page, see https://sites.google.com/view/npfa/ Notice: You may not copy, reproduce, distribute, publish, display, perform, modify, create derivative works, transmit, or in any way exploit any such content, nor may you distribute any part of this content over any network, including a local area network, sell or offer it for sale, or use such content to construct any kind of database

Production-level workflows for producing convincing 3D dynamic human faces have long relied on an assortment of labor-intensive tools for geometry and texture generation, motion capture and rigging, and expression synthesis. Recent neural approaches automate individual components but the corresponding latent representations cannot provide artists with explicit controls as in conventional tools. In this paper, we present a new learning-based, video-driven approach for generating dynamic facial geometries with high-quality physically-based assets. For data collection, we construct a hybrid multiview-photometric capture stage, coupling with ultra-fast video cameras to obtain raw 3D facial assets. We then set out to model the facial expression, geometry and physically-based textures using separate VAEs where we impose a global MLP based expression mapping across the latent spaces of respective networks, to preserve characteristics across respective attributes. We also model the delta information as wrinkle maps for the physically-based textures, achieving high-quality 4K dynamic textures. We demonstrate our approach in high-fidelity performer-specific facial capture and cross-identity facial motion retargeting. In addition, our multi-VAE-based neural asset, along with the fast adaptation schemes, can also be deployed to handle in-the-wild videos. Besides, we motivate the utility of our explicit facial disentangling strategy by providing various promising physically-based editing results with high realism. Comprehensive experiments show that our technique provides higher accuracy and visual fidelity than previous video-driven facial reconstruction and animation methods.

翻译：制作令人信服的 3D 动态人类面孔的生产水平工作流程长期以来依赖于各种劳动密集型工具的组合,这些工具用于几何和纹理生成、运动捕捉和操纵,以及表达合成。最近神经对个体组件采取自动处理方法,但相应的潜在代表无法像常规工具那样为艺术家提供明确的控制。在本文中,我们展示了一种新的基于学习的、视频驱动的方法,用高质量的物理资产生成动态面部偏差。在数据收集方面,我们构建了一个混合多视光度抓取阶段,与超快视频相机联在一起,以获取原始的 3D 面部资产。然后我们开始用不同的 VAEE 来模拟面部表达、几何和基于物理的纹理的纹理。此外,我们还可以将基于全球 MLP 的表达图像绘制出一个跨越各个网络潜在空间的清晰的图像图,以保存各个属性的特征。我们还将三角图信息建模成为基于物理的纹理的微缩图,实现高质量的 4K 动态质谱。我们展示了高端度的直径直径的面镜拍摄和跨直径直径直径直径的面图,我们的方法可以提供直观的直观的直观的直径图像的图像再展示。

相关内容

ASSETS

关注 0

ACM SIGACCESS Conference on Computers and Accessibility是为残疾人和老年人提供与计算机相关的设计、评估、使用和教育研究的首要论坛。我们欢迎提交原始的高质量的有关计算和可访问性的主题。今年，ASSETS首次将其范围扩大到包括关于计算机无障碍教育相关主题的原创高质量研究。官网链接：http://assets19.sigaccess.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【图神经网络实用介绍】A practical introduction to GNNs - Part 1

专知会员服务

28+阅读 · 2022年2月28日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日