用自己的声音讲外语:跨语言神经规范语言建模</s> (Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · 语音合成 · HTTPS · 回合 ·

2023 年 3 月 7 日

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

翻译：用自己的声音讲外语:跨语言神经规范语言建模

Ziqiang Zhang,Long Zhou,Chengyi Wang,Sanyuan Chen,Yu Wu,Shujie Liu,Zhuo Chen,Yanqing Liu,Huaming Wang,Jinyu Li,Lei He,Sheng Zhao,Furu Wei

from arxiv, We encourage readers to listen to the audio samples on our demo page: \url{https://aka.ms/vallex}

We propose a cross-lingual neural codec language model, VALL-E X, for cross-lingual speech synthesis. Specifically, we extend VALL-E and train a multi-lingual conditional codec language model to predict the acoustic token sequences of the target language speech by using both the source language speech and the target language text as prompts. VALL-E X inherits strong in-context learning capabilities and can be applied for zero-shot cross-lingual text-to-speech synthesis and zero-shot speech-to-speech translation tasks. Experimental results show that it can generate high-quality speech in the target language via just one speech utterance in the source language as a prompt while preserving the unseen speaker's voice, emotion, and acoustic environment. Moreover, VALL-E X effectively alleviates the foreign accent problems, which can be controlled by a language ID. Audio samples are available at \url{https://aka.ms/vallex}.

翻译：我们提出跨语言神经规范语言模型,VALL-E X,用于跨语言语言合成。具体地说,我们推广VALL-E,并培训多语言有条件的多语言代码语言模型,通过使用源语言语言讲话和目标语言文本作为提示,预测目标语言语言语言语言语言语言的声象序列。VALL-E X继承了很强的文字学习能力,可用于零发跨语言文本对语音合成和零发语音对语音翻译任务。实验结果显示,它可以通过源语言只用一种语言发出高质量语言的高质量演讲,作为快速的发音,同时保护隐匿语言的声音、情感和声响音环境。此外,VALLLE-E X有效地缓解了外国口音问题,这些口音可以通过语言识别来控制。声音样本可在以下https://aka.ms/vallex}查阅。</s>

0

相关内容

语言模型化

语言模型化

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

利用核技术分析并构建金属标记富勒烯多功能纳米材料

国家自然科学基金

0+阅读 · 2013年12月31日

跨语言社会舆情分析基础理论与关键技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

单细胞遗传分析仪研制

国家自然科学基金

0+阅读 · 2012年12月31日

微流控芯片—毛细管电泳—微液滴喷射雾化器等离子体质谱联机进行单细胞内金属形态分析的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HPLC-MS质控下的化瘀消癥杀胚中药对人输卵管妊娠滋养细胞影响的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings

Arxiv

0+阅读 · 2023年4月27日

CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing

Arxiv

0+阅读 · 2023年4月26日

Knowledge Graph Embedding: A Survey from the Perspective of Representation Spaces

Arxiv

18+阅读 · 2022年11月7日

Learning to Respond with Stickers: A Framework of Unifying Multi-Modality in Multi-Turn Dialog

Learning to Respond with Stickers: A Framework of Unifying Multi-Modality in Multi-Turn Dialog

Arxiv

14+阅读 · 2020年3月10日

One for All: Neural Joint Modeling of Entities and Events

Arxiv

11+阅读 · 2018年12月1日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

从代码基础模型到智能体与应用：代码智能的全面综述与实践指南

《北约认知战概念报告》

【MIT博士论文】高效的视觉合成生成模型

美海军放弃星座级转而采用国家安全巡逻舰设计

相关资讯

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings

Arxiv

0+阅读 · 2023年4月27日

CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing

Arxiv

0+阅读 · 2023年4月26日

Knowledge Graph Embedding: A Survey from the Perspective of Representation Spaces

Arxiv

18+阅读 · 2022年11月7日

Learning to Respond with Stickers: A Framework of Unifying Multi-Modality in Multi-Turn Dialog

Learning to Respond with Stickers: A Framework of Unifying Multi-Modality in Multi-Turn Dialog

Arxiv

14+阅读 · 2020年3月10日

One for All: Neural Joint Modeling of Entities and Events

Arxiv

11+阅读 · 2018年12月1日

相关基金

利用核技术分析并构建金属标记富勒烯多功能纳米材料

国家自然科学基金

0+阅读 · 2013年12月31日

跨语言社会舆情分析基础理论与关键技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

单细胞遗传分析仪研制

国家自然科学基金

0+阅读 · 2012年12月31日

微流控芯片—毛细管电泳—微液滴喷射雾化器等离子体质谱联机进行单细胞内金属形态分析的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HPLC-MS质控下的化瘀消癥杀胚中药对人输卵管妊娠滋养细胞影响的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员