指导-TTS:通过分类指南进行文本到语音的传播模型 (Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance) - 专知论文

会员服务 ·

0

Guidance · 语音合成 · MoDELS · Performer · 音素 ·

2022 年 1 月 29 日

Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance

翻译：指导-TTS:通过分类指南进行文本到语音的传播模型

Heeseung Kim,Sungwon Kim,Sungroh Yoon

We propose Guided-TTS, a high-quality text-to-speech (TTS) model that does not require any transcript of target speaker using classifier guidance. Guided-TTS combines an unconditional diffusion probabilistic model with a separately trained phoneme classifier for classifier guidance. Our unconditional diffusion model learns to generate speech without any context from untranscribed speech data. For TTS synthesis, we guide the generative process of the diffusion model with a phoneme classifier trained on a large-scale speech recognition dataset. We present a norm-based scaling method that reduces the pronunciation errors of classifier guidance in Guided-TTS. We show that Guided-TTS achieves a performance comparable to that of the state-of-the-art TTS model, Grad-TTS, without any transcript for LJSpeech. We further demonstrate that Guided-TTS performs well on diverse datasets including a long-form untranscribed dataset.

翻译：我们建议采用导引-TTS(TTS)模式,这是一个不需要使用分类指导的目标演讲者笔录的高质量文本到语音(TTS)模式。指导-TTS将无条件的传播概率模型与单独训练的分类指导的语音分类器相结合。我们无条件的传播模型学会在没有任何未经调试的语音数据背景的情况下生成语音。对于 TTS 合成, 我们用在大型语音识别数据集上受过培训的语音分类器指导扩散模型的基因化过程。我们提出了一个基于规范的缩放方法, 减少指导- TTS 中分类者指南的发音错误。我们显示, 指导- TTS 取得了与最先进的 TTS 模型( Grad-TTS) 相似的性能, 但没有为 LJSpeech 提供任何记录。我们进一步证明, 指导-TTS 在多种数据集上表现良好, 包括一个长式的未调制数据集。

0

相关内容

Guidance

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

专知会员服务

34+阅读 · 2021年11月30日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

324+阅读 · 2020年11月26日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

基于血脑PK-PD和结构方程模型的银杏叶提取物多组分协同抗脑缺血作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

GSK-3β在造影剂致肾小管上皮细胞损伤中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

氧化石墨烯脂质体的制备及其在化学发光生物分析中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

南极表层海水低温微生物筛选及其低温适应机制探讨

国家自然科学基金

1+阅读 · 2014年12月31日

TTMB赋能干预模式对慢性肾病血液透析患者自我管理和身心健康的效果和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

调节性B细胞在HIV-1慢性感染中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

中国东部壳斗科植物潜叶昆虫多样性的纬度梯度分布格局

国家自然科学基金

0+阅读 · 2012年12月31日

IL-17A在肺炎链球菌急性中耳炎黏膜免疫保护中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

随机微分方程概周期解和遍历解

国家自然科学基金

4+阅读 · 2011年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

An Energy-Based Prior for Generative Saliency

Arxiv

0+阅读 · 2022年4月19日

Imbalanced Classification via a Tabular Translation GAN

Arxiv

0+阅读 · 2022年4月19日

Span Classification with Structured Information for Disfluency Detection in Spoken Utterances

Arxiv

0+阅读 · 2022年4月18日

medXGAN: Visual Explanations for Medical Classifiers through a Generative Latent Space

Arxiv

0+阅读 · 2022年4月17日

Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation

Arxiv

0+阅读 · 2022年4月15日

More Control for Free! Image Synthesis with Semantic Diffusion Guidance

Arxiv

1+阅读 · 2022年4月14日

Cross-Domain Few-Shot Graph Classification

Arxiv

13+阅读 · 2022年1月20日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

Pose-Normalized Image Generation for Person Re-identification

Arxiv

11+阅读 · 2018年1月18日

VIP会员

文章信息

相关主题

相关VIP内容

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

专知会员服务

34+阅读 · 2021年11月30日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

324+阅读 · 2020年11月26日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《电磁环境模拟：弥补太空领域作战训练缺口》

Nature 子刊 | SciToolAgent:知识图谱引导的科学工具智能体

《军事行动中的人机AI编队本体模型》

《面向军事网络的下一代云事件响应》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

An Energy-Based Prior for Generative Saliency

Arxiv

0+阅读 · 2022年4月19日

Imbalanced Classification via a Tabular Translation GAN

Arxiv

0+阅读 · 2022年4月19日

Span Classification with Structured Information for Disfluency Detection in Spoken Utterances

Arxiv

0+阅读 · 2022年4月18日

medXGAN: Visual Explanations for Medical Classifiers through a Generative Latent Space

Arxiv

0+阅读 · 2022年4月17日

Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation

Arxiv

0+阅读 · 2022年4月15日

More Control for Free! Image Synthesis with Semantic Diffusion Guidance

Arxiv

1+阅读 · 2022年4月14日

Cross-Domain Few-Shot Graph Classification

Arxiv

13+阅读 · 2022年1月20日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

Pose-Normalized Image Generation for Person Re-identification

Arxiv

11+阅读 · 2018年1月18日

相关基金

基于血脑PK-PD和结构方程模型的银杏叶提取物多组分协同抗脑缺血作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

GSK-3β在造影剂致肾小管上皮细胞损伤中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

氧化石墨烯脂质体的制备及其在化学发光生物分析中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

南极表层海水低温微生物筛选及其低温适应机制探讨

国家自然科学基金

1+阅读 · 2014年12月31日

TTMB赋能干预模式对慢性肾病血液透析患者自我管理和身心健康的效果和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

调节性B细胞在HIV-1慢性感染中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

中国东部壳斗科植物潜叶昆虫多样性的纬度梯度分布格局

国家自然科学基金

0+阅读 · 2012年12月31日

IL-17A在肺炎链球菌急性中耳炎黏膜免疫保护中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

随机微分方程概周期解和遍历解

国家自然科学基金

4+阅读 · 2011年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员