Geneface: 通用和高清晰度的音频驱动器 3D 语音面孔合成 (GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis) - 专知论文

会员服务 ·

0

3D · Learning · Extensibility · 逼真度 · 分离的 ·

2023 年 1 月 31 日

GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis

翻译：Geneface: 通用和高清晰度的音频驱动器 3D 语音面孔合成

Zhenhui Ye,Ziyue Jiang,Yi Ren,Jinglin Liu,JinZheng He,Zhou Zhao

from arxiv, Accepted by ICLR2023. Project page: https://geneface.github.io/

Generating photo-realistic video portrait with arbitrary speech audio is a crucial problem in film-making and virtual reality. Recently, several works explore the usage of neural radiance field in this task to improve 3D realness and image fidelity. However, the generalizability of previous NeRF-based methods to out-of-domain audio is limited by the small scale of training data. In this work, we propose GeneFace, a generalized and high-fidelity NeRF-based talking face generation method, which can generate natural results corresponding to various out-of-domain audio. Specifically, we learn a variaitional motion generator on a large lip-reading corpus, and introduce a domain adaptative post-net to calibrate the result. Moreover, we learn a NeRF-based renderer conditioned on the predicted facial motion. A head-aware torso-NeRF is proposed to eliminate the head-torso separation problem. Extensive experiments show that our method achieves more generalized and high-fidelity talking face generation compared to previous methods.

翻译：在制作电影和虚拟现实时,一个至关重要的问题。最近,一些作品探索了在这一任务中使用神经光亮场,以提高3D真实性和图像忠诚性。然而,以前基于NeRF的音频外出音频方法的通用性受到培训数据规模小的限制。在这项工作中,我们提议GeneFace,这是一个普遍和高度不理解的NERF语访谈面部生成方法,可以产生与各种外出音频相匹配的自然结果。具体地说,我们在大型读唇机上学习了一种液态运动生成器,并引入了一种对结果进行校准的域性调整后网络。此外,我们学习了以预测的面部动作为条件的基于NERF的变音器。我们提议了一种以头部和高侧面部分离为主的图像生成法,以消除头部和高侧面部分离问题。广泛的实验表明,我们的方法比以往的方法更加普及和高度和高侧面部对话生成。

2

相关内容

3D是英文“Three Dimensions”的简称，中文是指三维、三个维度、三个坐标，即有长、有宽、有高，换句话说，就是立体的，是相对于只有长和宽的平面（2D）而言。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

专知

31+阅读 · 2018年6月4日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新六篇视频分类相关论文—层次标签推断、知识图谱、CNNs、DAiSEE、表观和关系网络、转移学习

【论文推荐】最新六篇视频分类相关论文—层次标签推断、知识图谱、CNNs、DAiSEE、表观和关系网络、转移学习

专知

13+阅读 · 2018年2月18日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Massive MIMO 系统中接收端低复杂度检测技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

肠-肝胆汁酸感受在胃旁路术后早期胰岛素敏感性改善中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

三维互联网应用中的服饰实时动画关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

母体变应性鼻炎对子代Foxp3DNA甲基化的影响及干预

国家自然科学基金

0+阅读 · 2012年12月31日

糖基化修饰大豆蛋白过敏原的调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

RGM与neogenin信号调控应激性精神障碍-PTSD杏仁核、海马神经细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

高光度blazar的甚高能伽马射线辐射研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于金属纳米粒子的超高磁响应性复合微球的制备研究

国家自然科学基金

0+阅读 · 2009年12月31日

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Arxiv

0+阅读 · 2023年3月22日

AeDet: Azimuth-invariant Multi-view 3D Object Detection

Arxiv

0+阅读 · 2023年3月22日

PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D Supervision

Arxiv

0+阅读 · 2023年3月21日

Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation

Arxiv

0+阅读 · 2023年3月20日

EmoTalk: Speech-driven emotional disentanglement for 3D face animation

Arxiv

0+阅读 · 2023年3月20日

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

Arxiv

0+阅读 · 2023年3月18日

MODIFY: Model-driven Face Stylization without Style Images

Arxiv

0+阅读 · 2023年3月17日

Style Transfer for 2D Talking Head Animation

Arxiv

0+阅读 · 2023年3月17日

MMFace4D: A Large-Scale Multi-Modal 4D Face Dataset for Audio-Driven 3D Face Animation

Arxiv

1+阅读 · 2023年3月17日

DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection

Arxiv

0+阅读 · 2023年3月16日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

专知

31+阅读 · 2018年6月4日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新六篇视频分类相关论文—层次标签推断、知识图谱、CNNs、DAiSEE、表观和关系网络、转移学习

【论文推荐】最新六篇视频分类相关论文—层次标签推断、知识图谱、CNNs、DAiSEE、表观和关系网络、转移学习

专知

13+阅读 · 2018年2月18日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Arxiv

0+阅读 · 2023年3月22日

AeDet: Azimuth-invariant Multi-view 3D Object Detection

Arxiv

0+阅读 · 2023年3月22日

PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D Supervision

Arxiv

0+阅读 · 2023年3月21日

Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation

Arxiv

0+阅读 · 2023年3月20日

EmoTalk: Speech-driven emotional disentanglement for 3D face animation

Arxiv

0+阅读 · 2023年3月20日

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

Arxiv

0+阅读 · 2023年3月18日

MODIFY: Model-driven Face Stylization without Style Images

Arxiv

0+阅读 · 2023年3月17日

Style Transfer for 2D Talking Head Animation

Arxiv

0+阅读 · 2023年3月17日

MMFace4D: A Large-Scale Multi-Modal 4D Face Dataset for Audio-Driven 3D Face Animation

Arxiv

1+阅读 · 2023年3月17日

DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection

Arxiv

0+阅读 · 2023年3月16日

相关基金

Massive MIMO 系统中接收端低复杂度检测技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

肠-肝胆汁酸感受在胃旁路术后早期胰岛素敏感性改善中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

三维互联网应用中的服饰实时动画关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

母体变应性鼻炎对子代Foxp3DNA甲基化的影响及干预

国家自然科学基金

0+阅读 · 2012年12月31日

糖基化修饰大豆蛋白过敏原的调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

RGM与neogenin信号调控应激性精神障碍-PTSD杏仁核、海马神经细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

高光度blazar的甚高能伽马射线辐射研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于金属纳米粒子的超高磁响应性复合微球的制备研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员