FST:IWWSLT21多语言共享任务FAIR语音翻译系统 (FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task) - 专知论文

会员服务 ·

0

语音翻译 · Facebook AI Research · 端到端 · TEDx · 未标记 ·

2021 年 7 月 14 日

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task

翻译：FST:IWWSLT21多语言共享任务FAIR语音翻译系统

Yun Tang,Hongyu Gong,Xian Li,Changhan Wang,Juan Pino,Holger Schwenk,Naman Goyal

from arxiv, Accepted by IWSLT 2021 as a system paper

In this paper, we describe our end-to-end multilingual speech translation system submitted to the IWSLT 2021 evaluation campaign on the Multilingual Speech Translation shared task. Our system is built by leveraging transfer learning across modalities, tasks and languages. First, we leverage general-purpose multilingual modules pretrained with large amounts of unlabelled and labelled data. We further enable knowledge transfer from the text task to the speech task by training two tasks jointly. Finally, our multilingual model is finetuned on speech translation task-specific data to achieve the best translation results. Experimental results show our system outperforms the reported systems, including both end-to-end and cascaded based approaches, by a large margin. In some translation directions, our speech translation results evaluated on the public Multilingual TEDx test set are even comparable with the ones from a strong text-to-text translation system, which uses the oracle speech transcripts as input.

翻译：在本文中,我们描述了我们提交给IWSLT 2021年多语种语言翻译共同任务评价运动的端到端多语种语言翻译系统。我们的系统是通过利用不同模式、任务和语言的转移学习而建立的。首先,我们利用通用多语种模块,这些模块经过大量未加标签和贴标签的数据的预先培训。我们通过联合培训两项任务,进一步从文本任务向演讲任务转移知识。最后,我们的多语种模式对语言翻译任务特定数据进行了微调,以取得最佳翻译结果。实验结果显示我们的系统大大超过了所报告的系统,包括端到端和级联方法。在某些翻译方向上,在公开的多语种TEDx测试组上评价的语音翻译结果甚至与强健的文本到文本翻译系统所评估的结果相仿,后者使用口头语音记录作为投入。

0

相关内容

语音翻译

通过计算机进行不同语言之间的直接语音翻译，辅助不同语言背景的人们进行沟通已经成为世界各国研究的重点。和一般的文本翻译不同，语音翻译需要把语音识别、机器翻译和语音合成三大技术进行集成，具有很大的挑战性。

【DeepMind】多模态预训练模型概述，37页ppt

【DeepMind】多模态预训练模型概述，37页ppt

专知会员服务

95+阅读 · 2021年7月2日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

神经机器翻译前沿综述

专知会员服务

28+阅读 · 2020年9月9日

【IJCAI2020南大】上下文在神经机器翻译中的充分利用

【IJCAI2020南大】上下文在神经机器翻译中的充分利用

专知会员服务

16+阅读 · 2020年8月17日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【Yoshua Bengio新论文】多任务自监督学习语音识别，MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION

【Yoshua Bengio新论文】多任务自监督学习语音识别，MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION

专知会员服务

39+阅读 · 2020年1月30日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

本周论文推荐 -- 对抗生成网络、知识图谱补全、对话系统、文本生成

本周论文推荐 -- 对抗生成网络、知识图谱补全、对话系统、文本生成

深度学习自然语言处理

8+阅读 · 2020年1月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

翻译|同声传译被攻陷！谷歌发布Translatotron直接语音翻译系统

翻译|同声传译被攻陷！谷歌发布Translatotron直接语音翻译系统

机器人大讲堂

4+阅读 · 2019年5月17日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正

【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正

专知

14+阅读 · 2018年2月4日

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

专知

12+阅读 · 2018年2月2日

Enabling Zero-shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders

Arxiv

0+阅读 · 2021年9月15日

UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation

Arxiv

0+阅读 · 2021年9月15日

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

Arxiv

0+阅读 · 2021年9月14日

Netmarble AI Center's WMT21 Automatic Post-Editing Shared Task Submission

Arxiv

0+阅读 · 2021年9月14日

Evaluating Multiway Multilingual NMT in the Turkic Languages

Arxiv

0+阅读 · 2021年9月13日

Curriculum Pre-training for End-to-End Speech Translation

Arxiv

4+阅读 · 2020年4月21日

Multi-Task Neural Models for Translating Between Styles Within and Across Languages

Arxiv

4+阅读 · 2018年6月12日

A Survey of Domain Adaptation for Neural Machine Translation

Arxiv

18+阅读 · 2018年6月1日

Metric for Automatic Machine Translation Evaluation based on Universal Sentence Representations

Arxiv

4+阅读 · 2018年5月18日

Unsupervised Neural Machine Translation with Weight Sharing

Arxiv

6+阅读 · 2018年4月24日

VIP会员

文章信息

相关主题

Facebook AI Research

相关VIP内容

【DeepMind】多模态预训练模型概述，37页ppt

【DeepMind】多模态预训练模型概述，37页ppt

专知会员服务

95+阅读 · 2021年7月2日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

神经机器翻译前沿综述

专知会员服务

28+阅读 · 2020年9月9日

【IJCAI2020南大】上下文在神经机器翻译中的充分利用

【IJCAI2020南大】上下文在神经机器翻译中的充分利用

专知会员服务

16+阅读 · 2020年8月17日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【Yoshua Bengio新论文】多任务自监督学习语音识别，MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION

【Yoshua Bengio新论文】多任务自监督学习语音识别，MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION

专知会员服务

39+阅读 · 2020年1月30日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【普林斯顿博士论文】在线学习：优化、控制与学习理论

不确定环境下无人机三维路径规划研究 | 221页

【NeurIPS2025】《LeapFactual：基于条件流匹配的可靠视觉反事实解释》

大语言模型将如何改变军事指挥结构

相关资讯

本周论文推荐 -- 对抗生成网络、知识图谱补全、对话系统、文本生成

本周论文推荐 -- 对抗生成网络、知识图谱补全、对话系统、文本生成

深度学习自然语言处理

8+阅读 · 2020年1月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

翻译|同声传译被攻陷！谷歌发布Translatotron直接语音翻译系统

翻译|同声传译被攻陷！谷歌发布Translatotron直接语音翻译系统

机器人大讲堂

4+阅读 · 2019年5月17日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正

【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正

专知

14+阅读 · 2018年2月4日

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

专知

12+阅读 · 2018年2月2日

相关论文

Enabling Zero-shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders

Arxiv

0+阅读 · 2021年9月15日

UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation

Arxiv

0+阅读 · 2021年9月15日

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

Arxiv

0+阅读 · 2021年9月14日

Netmarble AI Center's WMT21 Automatic Post-Editing Shared Task Submission

Arxiv

0+阅读 · 2021年9月14日

Evaluating Multiway Multilingual NMT in the Turkic Languages

Arxiv

0+阅读 · 2021年9月13日

Curriculum Pre-training for End-to-End Speech Translation

Arxiv

4+阅读 · 2020年4月21日

Multi-Task Neural Models for Translating Between Styles Within and Across Languages

Arxiv

4+阅读 · 2018年6月12日

A Survey of Domain Adaptation for Neural Machine Translation

Arxiv

18+阅读 · 2018年6月1日

Metric for Automatic Machine Translation Evaluation based on Universal Sentence Representations

Arxiv

4+阅读 · 2018年5月18日

Unsupervised Neural Machine Translation with Weight Sharing

Arxiv

6+阅读 · 2018年4月24日

微信扫码咨询专知VIP会员