多语种文件水平翻译使零点热转换从句子到文件的多语种文件水平翻译得以实现 (Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents) - 专知论文

会员服务 ·

0

Performer · Machine Translation · 分解的 · Less · SimPLe ·

2022 年 5 月 17 日

Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents

翻译：多语种文件水平翻译使零点热转换从句子到文件的多语种文件水平翻译得以实现

Biao Zhang,Ankur Bapna,Melvin Johnson,Ali Dabirmoghaddam,Naveen Arivazhagan,Orhan Firat

from arxiv, ACL2022

Document-level neural machine translation (DocNMT) achieves coherent translations by incorporating cross-sentence context. However, for most language pairs there's a shortage of parallel documents, although parallel sentences are readily available. In this paper, we study whether and how contextual modeling in DocNMT is transferable via multilingual modeling. We focus on the scenario of zero-shot transfer from teacher languages with document level data to student languages with no documents but sentence level data, and for the first time treat document-level translation as a transfer learning problem. Using simple concatenation-based DocNMT, we explore the effect of 3 factors on the transfer: the number of teacher languages with document level data, the balance between document and sentence level data at training, and the data condition of parallel documents (genuine vs. backtranslated). Our experiments on Europarl-7 and IWSLT-10 show the feasibility of multilingual transfer for DocNMT, particularly on document-specific metrics. We observe that more teacher languages and adequate data balance both contribute to better transfer quality. Surprisingly, the transfer is less sensitive to the data condition, where multilingual DocNMT delivers decent performance with either backtranslated or genuine document pairs.

翻译：文档级神经机翻译(DocNMT)通过纳入交叉感应背景实现了一致翻译。然而,对于大多数对口语言来说,平行文件短缺,尽管可以随时提供平行的句子。在本文中,我们研究DocNMT中环境建模是否以及如何通过多语种建模可转让。我们侧重于将带有文件级数据的教师语言零发传输给没有文件但有判决级数据的学生语言的情景,并首次将文件级翻译作为传输学习问题处理。我们使用简单的同级化DocNMT,我们探索了3个因素对传输的影响:拥有文件级数据的教师语言数量、培训中文件和判决级数据之间的平衡以及平行文件的数据状况(reality vs back翻译)。我们关于Eurparl-7和IWSLT-10的实验表明,为DocNMT提供多语种语言传输的可行性,特别是文件特定指标。我们发现,更多的教师语言和适当的数据平衡都有助于提高传输质量。奇怪的是,转移对数据状况没有那么敏感,因为多语种DocNMT的文档或真正的背文件。

0

相关内容

Performer

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Dock3/Paks对癫痫突触可塑性的调控及异常神经网络形成机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

短文本情感分析关键技术研究

国家自然科学基金

9+阅读 · 2015年12月31日

聚精氨酸诱导肿瘤微环境的免疫活性及逆转cetuximab耐药性的调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

中文社交化短文本情感分析与话题挖掘研究

国家自然科学基金

3+阅读 · 2015年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

"β-hCG-ERK1/2-MMP-2"信号通路在卵巢癌侵袭、转移中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

D-A-A和D-A-Ar线型和星型有机供体材料的分子构筑、合成及其光伏性能的研究

国家自然科学基金

0+阅读 · 2012年12月31日

互联网藏文文本资源挖掘及语料抽取关键技术研究

国家自然科学基金

2+阅读 · 2012年12月31日

DNA甲基化、组蛋白修饰调控非小细胞肺癌lncRNA MALAT1表达的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

When does Bias Transfer in Transfer Learning?

Arxiv

0+阅读 · 2022年7月6日

Learning to Diversify for Product Question Generation

Arxiv

0+阅读 · 2022年7月6日

BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation

BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation

Arxiv

0+阅读 · 2022年7月5日

CRFormer: A Cross-Region Transformer for Shadow Removal

Arxiv

0+阅读 · 2022年7月4日

Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding

Arxiv

0+阅读 · 2022年7月4日

Multi-aspect Multilingual and Cross-lingual Parliamentary Speech Analysis

Arxiv

0+阅读 · 2022年7月3日

Efficient Re-parameterization Operations Search for Easy-to-Deploy Network Based on Directional Evolutionary Strategy

Arxiv

0+阅读 · 2022年7月3日

Sequence-aware multimodal page classification of Brazilian legal documents

Arxiv

0+阅读 · 2022年7月2日

Enhanced Knowledge Selection for Grounded Dialogues via Document Semantic Graphs

Arxiv

0+阅读 · 2022年6月30日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

VIP会员

文章信息

相关主题

Machine Translation

相关VIP内容

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML2025】QuRe：通过困难负样本采样实现查询相关的组合图像检索

自动驾驶中的3D目标检测研究进展

中文版 | 无人机战争与乌克兰战场演进（2024-2025）

【阿姆斯特丹博士论文】在嘈杂和低资源环境中提升神经检索器的鲁棒性与有效性

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

When does Bias Transfer in Transfer Learning?

Arxiv

0+阅读 · 2022年7月6日

Learning to Diversify for Product Question Generation

Arxiv

0+阅读 · 2022年7月6日

BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation

BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation

Arxiv

0+阅读 · 2022年7月5日

CRFormer: A Cross-Region Transformer for Shadow Removal

Arxiv

0+阅读 · 2022年7月4日

Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding

Arxiv

0+阅读 · 2022年7月4日

Multi-aspect Multilingual and Cross-lingual Parliamentary Speech Analysis

Arxiv

0+阅读 · 2022年7月3日

Efficient Re-parameterization Operations Search for Easy-to-Deploy Network Based on Directional Evolutionary Strategy

Arxiv

0+阅读 · 2022年7月3日

Sequence-aware multimodal page classification of Brazilian legal documents

Arxiv

0+阅读 · 2022年7月2日

Enhanced Knowledge Selection for Grounded Dialogues via Document Semantic Graphs

Arxiv

0+阅读 · 2022年6月30日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

相关基金

Dock3/Paks对癫痫突触可塑性的调控及异常神经网络形成机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

短文本情感分析关键技术研究

国家自然科学基金

9+阅读 · 2015年12月31日

聚精氨酸诱导肿瘤微环境的免疫活性及逆转cetuximab耐药性的调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

中文社交化短文本情感分析与话题挖掘研究

国家自然科学基金

3+阅读 · 2015年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

"β-hCG-ERK1/2-MMP-2"信号通路在卵巢癌侵袭、转移中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

D-A-A和D-A-Ar线型和星型有机供体材料的分子构筑、合成及其光伏性能的研究

国家自然科学基金

0+阅读 · 2012年12月31日

互联网藏文文本资源挖掘及语料抽取关键技术研究

国家自然科学基金

2+阅读 · 2012年12月31日

DNA甲基化、组蛋白修饰调控非小细胞肺癌lncRNA MALAT1表达的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员