MeshTalk: 3D 使用交叉方式分裂的演讲的面部动画 (MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement) - 专知论文

会员服务 ·

0

state-of-the-art · 3D · INFORMS · 讲稿 · CASES ·

2022 年 5 月 20 日

MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement

翻译：MeshTalk: 3D 使用交叉方式分裂的演讲的面部动画

Alexander Richard,Michael Zollhoefer,Yandong Wen,Fernando de la Torre,Yaser Sheikh

from arxiv, updated link to github repository and supplemental video

This paper presents a generic method for generating full facial 3D animation from speech. Existing approaches to audio-driven facial animation exhibit uncanny or static upper face animation, fail to produce accurate and plausible co-articulation or rely on person-specific models that limit their scalability. To improve upon existing models, we propose a generic audio-driven facial animation approach that achieves highly realistic motion synthesis results for the entire face. At the core of our approach is a categorical latent space for facial animation that disentangles audio-correlated and audio-uncorrelated information based on a novel cross-modality loss. Our approach ensures highly accurate lip motion, while also synthesizing plausible animation of the parts of the face that are uncorrelated to the audio signal, such as eye blinks and eye brow motion. We demonstrate that our approach outperforms several baselines and obtains state-of-the-art quality both qualitatively and quantitatively. A perceptual user study demonstrates that our approach is deemed more realistic than the current state-of-the-art in over 75% of cases. We recommend watching the supplemental video before reading the paper: https://github.com/facebookresearch/meshtalk

翻译：本文介绍了一种通用方法,用于从言语中生成完整的面部 3D 动画; 现有的由声音驱动的面部动动画展示出不光彩或静态的上脸动画,未能产生准确和可信的共同演示,或依赖限制其可缩放性的个人特有模型。为了改进现有的模型,我们提议了一种由声音驱动的面部动动画通用方法,为整个脸部取得高度现实的动作合成结果。我们的方法核心是面部动动画的绝对潜在空间,它分解了以新颖的跨时尚损失为基础的与声音相关和与声音不相容的信息。我们的方法确保了高度准确的嘴部运动,同时还合成了与声音信号不相容的面部分的貌似动画,例如眨眼和眼眉毛运动。我们证明我们的方法超越了几个基线,并获得了质量和数量两方面的状态。一种概念用户研究表明,我们的方法被认为比超过75%的案例中的当前状态更为现实。我们建议在阅读论文之前的辅助性视频: http://gistrabstalbly。

0

相关内容

state-of-the-art

state-of-the-art

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

国家天文台科研亮点基金成果多媒体展示和交互体验

国家自然科学基金

1+阅读 · 2013年12月31日

多元线性整值时间序列的统计分析

国家自然科学基金

2+阅读 · 2013年12月31日

隐伏矿弱缓异常识别与奇异性地质统计学建模

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

4d、5d关联金属氧化物制备与多重量子序研究

国家自然科学基金

0+阅读 · 2011年12月31日

附睾蛋白酶抑制剂(EPPIN)基因转录调控的分子机理

国家自然科学基金

0+阅读 · 2009年12月31日

自适应光学在人眼微视野缺损评价中的应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于图像处理和重建的Radon型广义变换及其关键技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

Conditioning of a Hybrid High-Order scheme on meshes with small faces

Arxiv

0+阅读 · 2022年7月8日

A Novel Unified Conditional Score-based Generative Framework for Multi-modal Medical Image Completion

A Novel Unified Conditional Score-based Generative Framework for Multi-modal Medical Image Completion

Arxiv

0+阅读 · 2022年7月7日

BMD-GAN: Bone mineral density estimation using x-ray image decomposition into projections of bone-segmented quantitative computed tomography using hierarchical learning

BMD-GAN: Bone mineral density estimation using x-ray image decomposition into projections of bone-segmented quantitative computed tomography using hierarchical learning

Arxiv

0+阅读 · 2022年7月7日

Expression-preserving face frontalization improves visually assisted speech processing

Arxiv

0+阅读 · 2022年7月6日

Physical Interaction and Manipulation of the Environment using Aerial Robots

Arxiv

0+阅读 · 2022年7月6日

From 2D Images to 3D Model:Weakly Supervised Multi-View Face Reconstruction with Deep Fusion

Arxiv

0+阅读 · 2022年7月6日

Latents2Segments: Disentangling the Latent Space of Generative Models for Semantic Segmentation of Face Images

Latents2Segments: Disentangling the Latent Space of Generative Models for Semantic Segmentation of Face Images

Arxiv

0+阅读 · 2022年7月6日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

SDD-FIQA: Unsupervised Face Image Quality Assessment with Similarity Distribution Distance

Arxiv

13+阅读 · 2021年3月10日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

Conditioning of a Hybrid High-Order scheme on meshes with small faces

Arxiv

0+阅读 · 2022年7月8日

A Novel Unified Conditional Score-based Generative Framework for Multi-modal Medical Image Completion

A Novel Unified Conditional Score-based Generative Framework for Multi-modal Medical Image Completion

Arxiv

0+阅读 · 2022年7月7日

BMD-GAN: Bone mineral density estimation using x-ray image decomposition into projections of bone-segmented quantitative computed tomography using hierarchical learning

BMD-GAN: Bone mineral density estimation using x-ray image decomposition into projections of bone-segmented quantitative computed tomography using hierarchical learning

Arxiv

0+阅读 · 2022年7月7日

Expression-preserving face frontalization improves visually assisted speech processing

Arxiv

0+阅读 · 2022年7月6日

Physical Interaction and Manipulation of the Environment using Aerial Robots

Arxiv

0+阅读 · 2022年7月6日

From 2D Images to 3D Model:Weakly Supervised Multi-View Face Reconstruction with Deep Fusion

Arxiv

0+阅读 · 2022年7月6日

Latents2Segments: Disentangling the Latent Space of Generative Models for Semantic Segmentation of Face Images

Latents2Segments: Disentangling the Latent Space of Generative Models for Semantic Segmentation of Face Images

Arxiv

0+阅读 · 2022年7月6日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

SDD-FIQA: Unsupervised Face Image Quality Assessment with Similarity Distribution Distance

Arxiv

13+阅读 · 2021年3月10日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

相关基金

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

国家天文台科研亮点基金成果多媒体展示和交互体验

国家自然科学基金

1+阅读 · 2013年12月31日

多元线性整值时间序列的统计分析

国家自然科学基金

2+阅读 · 2013年12月31日

隐伏矿弱缓异常识别与奇异性地质统计学建模

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

4d、5d关联金属氧化物制备与多重量子序研究

国家自然科学基金

0+阅读 · 2011年12月31日

附睾蛋白酶抑制剂(EPPIN)基因转录调控的分子机理

国家自然科学基金

0+阅读 · 2009年12月31日

自适应光学在人眼微视野缺损评价中的应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于图像处理和重建的Radon型广义变换及其关键技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员