语音及情感语义同步的三维人脸可视化：从发声器官到外观 - 专知基金

会员服务 ·

1

虚拟现实 · 人脸动画 ·

2014 年 12 月 31 日

语音及情感语义同步的三维人脸可视化：从发声器官到外观

国家自然科学基金

国家自然科学基金委员会

项目名称： 语音及情感语义同步的三维人脸可视化：从发声器官到外观

项目编号： No.61472393

项目类型： 面上项目

立项/批准年度： 2015

项目学科： 计算机科学学科

项目作者： 汪增福

作者单位： 中国科学院合肥物质科学研究院

项目金额： 80万元

中文摘要： 本项目从多模态人机交互问题入手，系统开展语音及情感语义同步的三维人脸可视化研究。总体研究目标如下：充分利用核磁共振成像(MRI)、电磁发音数据采集（EMA）和X光成像等多种发音信息获取手段，设计并实现文本和语音多种输入驱动的三维人脸动画合成方案，实际构建出语音和语义情感同步的、能够从内到外展示发音过程的实时高自然度三维情感人脸动画合成系统。针对系统实现过程中所面临的可实现性与高自然度之间、计算复杂度和实时性之间存在的矛盾和难题，从系统的角度，对多源发音数据融合、基于三维模型的人脸动画合成、三维发音器官运动建模、发音器官和语音的协同关系建模等诸问题进行深入研究，形成与之相关的关键技术并实际构建出以这些关键技术为基本构成元素的、绘声绘影的语音三维可视化系统，为研究走向实用化奠定基础。

中文关键词： 虚拟现实；人脸动画；可视化

英文摘要： The project focuses on the problem of multimodal human machine interaction. We will do research on speech and emotional semantic tagging synchronized 3D facial visualization. It is expected to achieve the following goals: by making full use of multiple pronunciation related information acquisition devices including the Magnetic Resonance Imaging (MRI), the Electro-Magnetic Articulography (EMA) and the X-ray imaging, we will present a facial animation generation scheme driven by text or (and) speech, and construct a high realistic and speech and emotional semantic tagging synchronized 3D facial visualization system which can run in real-time and show the detailed dynamic process of pronunciation from internal articulators to external appearances. In order to solve the problems between realizability and high degree of natural, and computational complexity and real-time in process of system implementation, we will address the problems such as sensor date fusion of multiple articulators, facial animation based on 3D head model, 3D dynamic modeling of articulators, and cooperative relation modeling between articulators and speech, form the corresponding key techniques and use them to construct vivid speech and emotional semantic tagging synchronized 3D facial visualization system and provide a concrete foundation for applications.

英文关键词： Virtual Reality;Facial Animation;Visualization

成为VIP会员查看完整内容

3

相关内容

虚拟现实

面向端边云协同架构的区块链技术综述

面向端边云协同架构的区块链技术综述

专知会员服务

49+阅读 · 2021年12月24日

基于流线的流场可视化绘制方法综述

基于流线的流场可视化绘制方法综述

专知会员服务

27+阅读 · 2021年12月9日

混合增强视觉认知架构及其关键技术进展

混合增强视觉认知架构及其关键技术进展

专知会员服务

44+阅读 · 2021年11月20日

多尺度变换像素级医学图像融合：研究进展、应用和挑战

专知会员服务

40+阅读 · 2021年9月25日

因果知识图谱自然语言理解

专知会员服务

81+阅读 · 2021年7月3日

SIGGRAPH 2021 | 学习带神经融合形状的人物动画

专知会员服务

15+阅读 · 2021年6月1日

虚实融合场景中的深度感知研究综述

专知会员服务

38+阅读 · 2021年5月25日

基于生理信号的情感计算研究综述

基于生理信号的情感计算研究综述

专知会员服务

63+阅读 · 2021年2月9日

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

专知会员服务

54+阅读 · 2019年11月16日

[综述]基于深度学习的开放领域对话系统研究综述

[综述]基于深度学习的开放领域对话系统研究综述

专知会员服务

80+阅读 · 2019年10月12日

【动态】第十二期可视化与可视分析国际学术报告成功举办

【动态】第十二期可视化与可视分析国际学术报告成功举办

中国图象图形学学会CSIG

1+阅读 · 2022年3月16日

TPAMI 2021｜听声识物：视音一致性下的视觉物体感知

TPAMI 2021｜听声识物：视音一致性下的视觉物体感知

机器之心

0+阅读 · 2022年3月5日

【预告】可视化与可视分析国际学术报告系列第十一期将于2月24日举办

【预告】可视化与可视分析国际学术报告系列第十一期将于2月24日举办

中国图象图形学学会CSIG

0+阅读 · 2022年2月14日

开发一个自己的数字人，FACEGOOD把语音驱动表情技术开源了

开发一个自己的数字人，FACEGOOD把语音驱动表情技术开源了

机器之心

0+阅读 · 2022年1月20日

连续直播70天，竟无人察觉这是个虚拟人

连续直播70天，竟无人察觉这是个虚拟人

机器之心

0+阅读 · 2021年12月23日

语音合成：模拟最像人类声音的系统

语音合成：模拟最像人类声音的系统

PaperWeekly

2+阅读 · 2021年11月30日

可视化理解四元数，愿你不再掉头发

可视化理解四元数，愿你不再掉头发

计算机视觉life

31+阅读 · 2019年1月2日

CCCF专题：史元春 | 自然人机交互

CCCF专题：史元春 | 自然人机交互

中国计算机学会

25+阅读 · 2018年5月18日

【前沿】凌空手势识别综述

【前沿】凌空手势识别综述

科技导报

12+阅读 · 2017年8月17日

干货｜全景视频拼接的关键技术分析

干货｜全景视频拼接的关键技术分析

全球人工智能

13+阅读 · 2017年7月15日

基于多模态脑影像处理和多维可视化的辅助诊疗技术研究

国家自然科学基金

4+阅读 · 2014年12月31日

人脸图像的身份和表情同步识别方法研究

国家自然科学基金

2+阅读 · 2012年12月31日

改善人机之间心理交互的产品界面设计研究

国家自然科学基金

4+阅读 · 2012年12月31日

情感驱动的人机交互中文本语音情感信息耦合关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于影像和语音分析的发音器官运动可视化

国家自然科学基金

0+阅读 · 2012年12月31日

基于视觉和触觉感知的手势交互及其在运动康复中的应用

国家自然科学基金

2+阅读 · 2012年12月31日

基于观测图像的发音器官运动合成研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于多目视觉的形体语言感知与识别研究

国家自然科学基金

2+阅读 · 2011年12月31日

仿人多指手的多维指尖力感知和同步控制

国家自然科学基金

0+阅读 · 2011年12月31日

表情人脸的视觉认知与智能计算

国家自然科学基金

0+阅读 · 2009年12月31日

A multi-task learning for cavitation detection and cavitation intensity recognition of valve acoustic signals

Arxiv

0+阅读 · 2022年4月20日

A sojourn-based approach to semi-Markov Reinforcement Learning

Arxiv

0+阅读 · 2022年4月20日

A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles

A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles

Arxiv

0+阅读 · 2022年4月20日

VSEGAN: Visual Speech Enhancement Generative Adversarial Network

Arxiv

0+阅读 · 2022年4月20日

Councils in Action: Automating the Curation of Municipal Governance Data for Research

Arxiv

0+阅读 · 2022年4月19日

A Study on Prompt-based Few-Shot Learning Methods for Belief State Tracking in Task-oriented Dialog Systems

Arxiv

0+阅读 · 2022年4月18日

A Fictitious-play Finite-difference Method for Linearly Solvable Mean Field Games

Arxiv

0+阅读 · 2022年4月15日

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Arxiv

10+阅读 · 2020年3月20日

Bilinear Attention Networks

Arxiv

11+阅读 · 2018年5月21日

Single-Shot Object Detection with Enriched Semantics

Arxiv

11+阅读 · 2018年4月8日

阅读: 0 点赞: 0

小贴士

登录享主题订阅及个性化推荐

相关主题

热门VIP内容

开通专知VIP会员享更多权益服务

《战区安全决策课程体系》最新244页

《"无人机航母"原型平台》

任务规划与地形分析：现代复杂环境作战导航体系

《攻击场景描述形式化模型研究》

相关VIP内容

面向端边云协同架构的区块链技术综述

面向端边云协同架构的区块链技术综述

专知会员服务

49+阅读 · 2021年12月24日

基于流线的流场可视化绘制方法综述

基于流线的流场可视化绘制方法综述

专知会员服务

27+阅读 · 2021年12月9日

混合增强视觉认知架构及其关键技术进展

混合增强视觉认知架构及其关键技术进展

专知会员服务

44+阅读 · 2021年11月20日

多尺度变换像素级医学图像融合：研究进展、应用和挑战

专知会员服务

40+阅读 · 2021年9月25日

因果知识图谱自然语言理解

专知会员服务

81+阅读 · 2021年7月3日

SIGGRAPH 2021 | 学习带神经融合形状的人物动画

专知会员服务

15+阅读 · 2021年6月1日

虚实融合场景中的深度感知研究综述

专知会员服务

38+阅读 · 2021年5月25日

基于生理信号的情感计算研究综述

基于生理信号的情感计算研究综述

专知会员服务

63+阅读 · 2021年2月9日

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

专知会员服务

54+阅读 · 2019年11月16日

[综述]基于深度学习的开放领域对话系统研究综述

[综述]基于深度学习的开放领域对话系统研究综述

专知会员服务

80+阅读 · 2019年10月12日

相关资讯

【动态】第十二期可视化与可视分析国际学术报告成功举办

【动态】第十二期可视化与可视分析国际学术报告成功举办

中国图象图形学学会CSIG

1+阅读 · 2022年3月16日

TPAMI 2021｜听声识物：视音一致性下的视觉物体感知

TPAMI 2021｜听声识物：视音一致性下的视觉物体感知

机器之心

0+阅读 · 2022年3月5日

【预告】可视化与可视分析国际学术报告系列第十一期将于2月24日举办

【预告】可视化与可视分析国际学术报告系列第十一期将于2月24日举办

中国图象图形学学会CSIG

0+阅读 · 2022年2月14日

开发一个自己的数字人，FACEGOOD把语音驱动表情技术开源了

开发一个自己的数字人，FACEGOOD把语音驱动表情技术开源了

机器之心

0+阅读 · 2022年1月20日

连续直播70天，竟无人察觉这是个虚拟人

连续直播70天，竟无人察觉这是个虚拟人

机器之心

0+阅读 · 2021年12月23日

语音合成：模拟最像人类声音的系统

语音合成：模拟最像人类声音的系统

PaperWeekly

2+阅读 · 2021年11月30日

可视化理解四元数，愿你不再掉头发

可视化理解四元数，愿你不再掉头发

计算机视觉life

31+阅读 · 2019年1月2日

CCCF专题：史元春 | 自然人机交互

CCCF专题：史元春 | 自然人机交互

中国计算机学会

25+阅读 · 2018年5月18日

【前沿】凌空手势识别综述

【前沿】凌空手势识别综述

科技导报

12+阅读 · 2017年8月17日

干货｜全景视频拼接的关键技术分析

干货｜全景视频拼接的关键技术分析

全球人工智能

13+阅读 · 2017年7月15日

相关基金

基于多模态脑影像处理和多维可视化的辅助诊疗技术研究

国家自然科学基金

4+阅读 · 2014年12月31日

人脸图像的身份和表情同步识别方法研究

国家自然科学基金

2+阅读 · 2012年12月31日

改善人机之间心理交互的产品界面设计研究

国家自然科学基金

4+阅读 · 2012年12月31日

情感驱动的人机交互中文本语音情感信息耦合关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于影像和语音分析的发音器官运动可视化

国家自然科学基金

0+阅读 · 2012年12月31日

基于视觉和触觉感知的手势交互及其在运动康复中的应用

国家自然科学基金

2+阅读 · 2012年12月31日

基于观测图像的发音器官运动合成研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于多目视觉的形体语言感知与识别研究

国家自然科学基金

2+阅读 · 2011年12月31日

仿人多指手的多维指尖力感知和同步控制

国家自然科学基金

0+阅读 · 2011年12月31日

表情人脸的视觉认知与智能计算

国家自然科学基金

0+阅读 · 2009年12月31日

相关论文

A multi-task learning for cavitation detection and cavitation intensity recognition of valve acoustic signals

Arxiv

0+阅读 · 2022年4月20日

A sojourn-based approach to semi-Markov Reinforcement Learning

Arxiv

0+阅读 · 2022年4月20日

A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles

A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles

Arxiv

0+阅读 · 2022年4月20日

VSEGAN: Visual Speech Enhancement Generative Adversarial Network

Arxiv

0+阅读 · 2022年4月20日

Councils in Action: Automating the Curation of Municipal Governance Data for Research

Arxiv

0+阅读 · 2022年4月19日

A Study on Prompt-based Few-Shot Learning Methods for Belief State Tracking in Task-oriented Dialog Systems

Arxiv

0+阅读 · 2022年4月18日

A Fictitious-play Finite-difference Method for Linearly Solvable Mean Field Games

Arxiv

0+阅读 · 2022年4月15日

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Arxiv

10+阅读 · 2020年3月20日

Bilinear Attention Networks

Arxiv

11+阅读 · 2018年5月21日

Single-Shot Object Detection with Enriched Semantics

Arxiv

11+阅读 · 2018年4月8日

微信扫码咨询专知VIP会员