DiffVoice: Text-to-Speech with Latent Diffusion - 专知论文

会员服务 ·

0

潜在 · 语音合成 · MoDELS · state-of-the-art · Performer ·

2023 年 4 月 23 日

DiffVoice: Text-to-Speech with Latent Diffusion

翻译：暂无翻译

Zhijun Liu,Yiwei Guo,Kai Yu

from arxiv, Accepted to ICASSP2023

In this work, we present DiffVoice, a novel text-to-speech model based on latent diffusion. We propose to first encode speech signals into a phoneme-rate latent representation with a variational autoencoder enhanced by adversarial training, and then jointly model the duration and the latent representation with a diffusion model. Subjective evaluations on LJSpeech and LibriTTS datasets demonstrate that our method beats the best publicly available systems in naturalness. By adopting recent generative inverse problem solving algorithms for diffusion models, DiffVoice achieves the state-of-the-art performance in text-based speech editing, and zero-shot adaptation.

翻译：暂无翻译

0

相关内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

miRNAs调控柿单宁合成代谢机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

KIBRA及APOE基因多态性对人脑记忆功能调控机制的多模态MRI研究

国家自然科学基金

0+阅读 · 2013年12月31日

产毒水华蓝藻胁迫下铜锈环棱螺的代谢响应机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Junction tree推理的多运动平台分散式协同导航算法研究

国家自然科学基金

2+阅读 · 2012年12月31日

南海红树林微生物抗肿瘤活性代谢产物的研究

国家自然科学基金

0+阅读 · 2012年12月31日

鼠疫菌调控子RcsB和RcsAB调控生物膜形成的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cr2AlC 211 型MAX相薄膜的合成及抗辐照损伤特性

国家自然科学基金

0+阅读 · 2012年12月31日

内反馈式可调谐太赫兹行波管振荡器研究

国家自然科学基金

0+阅读 · 2011年12月31日

不同类型强心苷抗肿瘤活性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

抗生素压力对嗜麦芽窄食单胞菌致病性的影响

国家自然科学基金

0+阅读 · 2009年12月31日

Multi-modal Latent Diffusion

Arxiv

0+阅读 · 2023年6月7日

Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge

Arxiv

0+阅读 · 2023年6月7日

Stable Diffusion is Unstable

Arxiv

0+阅读 · 2023年6月6日

INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems

Arxiv

0+阅读 · 2023年6月5日

Towards Anytime Classification in Early-Exit Architectures by Enforcing Conditional Monotonicity

Arxiv

0+阅读 · 2023年6月5日

Stable Diffusion is Untable

Arxiv

0+阅读 · 2023年6月5日

GFlowNet-EM for learning compositional latent variable models

Arxiv

0+阅读 · 2023年6月3日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Multi-modal Latent Diffusion

Arxiv

0+阅读 · 2023年6月7日

Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge

Arxiv

0+阅读 · 2023年6月7日

Stable Diffusion is Unstable

Arxiv

0+阅读 · 2023年6月6日

INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems

Arxiv

0+阅读 · 2023年6月5日

Towards Anytime Classification in Early-Exit Architectures by Enforcing Conditional Monotonicity

Arxiv

0+阅读 · 2023年6月5日

Stable Diffusion is Untable

Arxiv

0+阅读 · 2023年6月5日

GFlowNet-EM for learning compositional latent variable models

Arxiv

0+阅读 · 2023年6月3日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

相关基金

miRNAs调控柿单宁合成代谢机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

KIBRA及APOE基因多态性对人脑记忆功能调控机制的多模态MRI研究

国家自然科学基金

0+阅读 · 2013年12月31日

产毒水华蓝藻胁迫下铜锈环棱螺的代谢响应机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Junction tree推理的多运动平台分散式协同导航算法研究

国家自然科学基金

2+阅读 · 2012年12月31日

南海红树林微生物抗肿瘤活性代谢产物的研究

国家自然科学基金

0+阅读 · 2012年12月31日

鼠疫菌调控子RcsB和RcsAB调控生物膜形成的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cr2AlC 211 型MAX相薄膜的合成及抗辐照损伤特性

国家自然科学基金

0+阅读 · 2012年12月31日

内反馈式可调谐太赫兹行波管振荡器研究

国家自然科学基金

0+阅读 · 2011年12月31日

不同类型强心苷抗肿瘤活性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

抗生素压力对嗜麦芽窄食单胞菌致病性的影响

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员