IWSLT 2022年IWSLT 离线共享任务YY 跨端对端语音翻译系统 (The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task) - 专知论文

会员服务 ·

0

语音翻译 · 端到端 · MoDELS · BLEU · 未标记 ·

2022 年 6 月 12 日

The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

翻译：IWSLT 2022年IWSLT 离线共享任务YY 跨端对端语音翻译系统

Ziqiang Zhang,Junyi Ao,Shujie Liu,Furu Wei,Jinyu Li

from arxiv, 11 pages

This paper describes the submission of our end-to-end YiTrans speech translation system for the IWSLT 2022 offline task, which translates from English audio to German, Chinese, and Japanese. The YiTrans system is built on large-scale pre-trained encoder-decoder models. More specifically, we first design a multi-stage pre-training strategy to build a multi-modality model with a large amount of labeled and unlabeled data. We then fine-tune the corresponding components of the model for the downstream speech translation tasks. Moreover, we make various efforts to improve performance, such as data filtering, data augmentation, speech segmentation, model ensemble, and so on. Experimental results show that our YiTrans system obtains a significant improvement than the strong baseline on three translation directions, and it achieves +5.2 BLEU improvements over last year's optimal end-to-end system on tst2021 English-German. Our final submissions rank first on English-German and English-Chinese end-to-end systems in terms of the automatic evaluation metric. We make our code and models publicly available.

翻译：本文介绍我们为IWSLT 2022 离线任务提交的端到端 YiTrans 语音翻译系统,该系统由英文音频翻译为德文、中文和日文翻译。 YiTrans 系统是建立在大规模预先训练的编码编码解码模型基础上的。更具体地说,我们首先设计一个多阶段的培训前战略,以建立具有大量标签和无标签数据的多模式模式。然后我们细化下游语音翻译任务模式的相应组成部分。此外,我们还作出各种努力改进绩效,例如数据过滤、数据增强、语音分解、模块组合等。实验结果显示,我们的YiTrans系统比三个翻译方向的强大基线有显著改进,而且它比去年英语-德语的最佳端对端系统改进了5.2 BLEU。我们提交的最后文件在自动评价指标方面首先排在英语-德语和英语-中文端对端系统上。我们公布了我们的代码和模型。

0

相关内容

语音翻译

通过计算机进行不同语言之间的直接语音翻译，辅助不同语言背景的人们进行沟通已经成为世界各国研究的重点。和一般的文本翻译不同，语音翻译需要把语音识别、机器翻译和语音合成三大技术进行集成，具有很大的挑战性。

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Turn-on型次磺酸荧光探针的设计与合成

国家自然科学基金

0+阅读 · 2015年12月31日

sRNA伴侣蛋白Hfq与sRNA RsmY对藤黄绿菌素合成途径转录激活子PltR表达的转录后调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

lncRNA在左归丸、右归丸诱导BMSCs软骨分化中的表观遗传学机制

国家自然科学基金

0+阅读 · 2014年12月31日

免疫性肝损伤过程中细胞色素P450酶系下调的转录水平调节和翻译后蛋白修饰研究

国家自然科学基金

0+阅读 · 2014年12月31日

海洋天然产物Lamellarin D糖基化衍生物的合成与构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

番茄miRNA及其靶基因的挖掘与鉴定

国家自然科学基金

0+阅读 · 2011年12月31日

Levy过程驱动的随机Fast-Diffusion方程的Harnack不等式及其应用

国家自然科学基金

0+阅读 · 2011年12月31日

孕烷X受体介导的CYP3A4基因转录调控的表观遗传分子机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

ESS: Learning Event-based Semantic Segmentation from Still Images

Arxiv

0+阅读 · 2022年8月2日

Learning Where To Look -- Generative NAS is Surprisingly Efficient

Arxiv

0+阅读 · 2022年8月1日

An Open Source Interactive Visual Analytics Tool for Comparative Programming Comprehension

Arxiv

0+阅读 · 2022年7月29日

IMUNet: Efficient Regression Architecture for IMU Navigation and Positioning

Arxiv

0+阅读 · 2022年7月29日

Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network

Arxiv

0+阅读 · 2022年7月29日

Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation

Arxiv

0+阅读 · 2022年7月29日

Multimodal Neural Machine Translation with Search Engine Based Image Retrieval

Arxiv

0+阅读 · 2022年7月26日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Arxiv

17+阅读 · 2020年6月2日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

Towards Understanding and Answering Multi-Sentence Recommendation Questions on Tourism

Arxiv

15+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

ESS: Learning Event-based Semantic Segmentation from Still Images

Arxiv

0+阅读 · 2022年8月2日

Learning Where To Look -- Generative NAS is Surprisingly Efficient

Arxiv

0+阅读 · 2022年8月1日

An Open Source Interactive Visual Analytics Tool for Comparative Programming Comprehension

Arxiv

0+阅读 · 2022年7月29日

IMUNet: Efficient Regression Architecture for IMU Navigation and Positioning

Arxiv

0+阅读 · 2022年7月29日

Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network

Arxiv

0+阅读 · 2022年7月29日

Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation

Arxiv

0+阅读 · 2022年7月29日

Multimodal Neural Machine Translation with Search Engine Based Image Retrieval

Arxiv

0+阅读 · 2022年7月26日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Arxiv

17+阅读 · 2020年6月2日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

Towards Understanding and Answering Multi-Sentence Recommendation Questions on Tourism

Arxiv

15+阅读 · 2018年1月5日

相关基金

Turn-on型次磺酸荧光探针的设计与合成

国家自然科学基金

0+阅读 · 2015年12月31日

sRNA伴侣蛋白Hfq与sRNA RsmY对藤黄绿菌素合成途径转录激活子PltR表达的转录后调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

lncRNA在左归丸、右归丸诱导BMSCs软骨分化中的表观遗传学机制

国家自然科学基金

0+阅读 · 2014年12月31日

免疫性肝损伤过程中细胞色素P450酶系下调的转录水平调节和翻译后蛋白修饰研究

国家自然科学基金

0+阅读 · 2014年12月31日

海洋天然产物Lamellarin D糖基化衍生物的合成与构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

番茄miRNA及其靶基因的挖掘与鉴定

国家自然科学基金

0+阅读 · 2011年12月31日

Levy过程驱动的随机Fast-Diffusion方程的Harnack不等式及其应用

国家自然科学基金

0+阅读 · 2011年12月31日

孕烷X受体介导的CYP3A4基因转录调控的表观遗传分子机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员