改进机器翻译的功能性和可适应性 (Improving Both Domain Robustness and Domain Adaptability in Machine Translation) - 专知论文

会员服务 ·

0

稳健性 · Machine Translation · MoDELS · HTTPS · Integration ·

2022 年 10 月 4 日

Improving Both Domain Robustness and Domain Adaptability in Machine Translation

翻译：改进机器翻译的功能性和可适应性

Wen Lai,Jindřich Libovický,Alexander Fraser

from arxiv, Accepted to COLING 2022

We consider two problems of NMT domain adaptation using meta-learning. First, we want to reach domain robustness, i.e., we want to reach high quality on both domains seen in the training data and unseen domains. Second, we want our systems to be adaptive, i.e., making it possible to finetune systems with just hundreds of in-domain parallel sentences. We study the domain adaptability of meta-learning when improving the domain robustness of the model. In this paper, we propose a novel approach, RMLNMT (Robust Meta-Learning Framework for Neural Machine Translation Domain Adaptation), which improves the robustness of existing meta-learning models. More specifically, we show how to use a domain classifier in curriculum learning and we integrate the word-level domain mixing model into the meta-learning framework with a balanced sampling strategy. Experiments on English$\rightarrow$German and English$\rightarrow$Chinese translation show that RMLNMT improves in terms of both domain robustness and domain adaptability in seen and unseen domains. Our source code is available at https://github.com/lavine-lmu/RMLNMT.

翻译：我们考虑的是利用元学习来调整NMT领域适应的两个问题。首先,我们想达到领域稳健度,即我们希望在培训数据和无形领域看到的两个领域都达到高质量。其次,我们希望我们的系统具有适应性,即有可能微调系统,仅用数百个在部内平行的句子。我们在改进模型的域稳健度时研究元学习的域适度。在本文中,我们提出一种新颖的方法,即RMLNMT(神经机器转换 Domain适应的Robust Met-Learning框架),它能提高现有元学习模式的稳健性。更具体地说,我们展示了如何在课程学习中使用域分类器,并将字级域模型与平衡的抽样战略结合到元学习框架。关于英语和德语的实验显示,RMLNMMT在可见和看不见域域域域的域稳健性和域适应性两方面都有改进。我们的源代码可在https://github.com/lavine-lam/RMMTNMTN中查阅。

0

相关内容

稳健性

《AI中毒攻击》34页slides

《AI中毒攻击》34页slides

专知会员服务

26+阅读 · 2022年10月17日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

近期必读的9篇CVPR 2019【域自适应（Domain Adaptation）】相关论文和代码

近期必读的9篇CVPR 2019【域自适应（Domain Adaptation）】相关论文和代码

专知会员服务

62+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Aβ外周清除功能在阿尔茨海默病发生中的作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

人APOE4基因影响AD早期神经元内Aβ42沉积诱导小胶质细胞活化及神经炎症级联反应的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

NF-κB信号通路调控溶酶体相关4次跨膜蛋白质B (LAPTM4B)促人肝细胞癌增殖作用的研究

国家自然科学基金

0+阅读 · 2013年12月31日

辅助T细胞分化调节在针刺治疗神经病理痛中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Diversin介导非小细胞肺癌长春瑞滨耐药的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

低氧对大鼠EIMD肌纤维膜损伤的影响机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

15-kDa硒蛋白在内质网应激（ERS）和阿尔茨海默病(AD)中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

TLR4活化TAP63a诱导细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

钙敏感性IRE1酶"门控"作用对肝癌细胞自噬生存/死亡转归的影响及药物干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

RGC-32参与TGF-β#35825;导肾小管上皮向间充质细胞转化的分子调控机制

国家自然科学基金

0+阅读 · 2008年12月31日

Multi-level Domain Adaptation for Lane Detection

Arxiv

0+阅读 · 2022年11月9日

What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation

Arxiv

0+阅读 · 2022年11月8日

Unsupervised Domain Adaptation for Sparse Retrieval by Filling Vocabulary and Word Frequency Gaps

Arxiv

0+阅读 · 2022年11月8日

On the Domain Adaptation and Generalization of Pretrained Language Models: A Survey

Arxiv

0+阅读 · 2022年11月6日

MiddleGAN: Generate Domain Agnostic Samples for Unsupervised Domain Adaptation

Arxiv

0+阅读 · 2022年11月6日

Non-Parametric Domain Adaptation for End-to-End Speech Translation

Arxiv

0+阅读 · 2022年11月4日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Arxiv

19+阅读 · 2021年4月19日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

A Survey of Domain Adaptation for Neural Machine Translation

Arxiv

17+阅读 · 2018年6月1日

VIP会员

文章信息

相关主题

Machine Translation

相关VIP内容

《AI中毒攻击》34页slides

《AI中毒攻击》34页slides

专知会员服务

26+阅读 · 2022年10月17日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

近期必读的9篇CVPR 2019【域自适应（Domain Adaptation）】相关论文和代码

近期必读的9篇CVPR 2019【域自适应（Domain Adaptation）】相关论文和代码

专知会员服务

62+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Multi-level Domain Adaptation for Lane Detection

Arxiv

0+阅读 · 2022年11月9日

What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation

Arxiv

0+阅读 · 2022年11月8日

Unsupervised Domain Adaptation for Sparse Retrieval by Filling Vocabulary and Word Frequency Gaps

Arxiv

0+阅读 · 2022年11月8日

On the Domain Adaptation and Generalization of Pretrained Language Models: A Survey

Arxiv

0+阅读 · 2022年11月6日

MiddleGAN: Generate Domain Agnostic Samples for Unsupervised Domain Adaptation

Arxiv

0+阅读 · 2022年11月6日

Non-Parametric Domain Adaptation for End-to-End Speech Translation

Arxiv

0+阅读 · 2022年11月4日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Arxiv

19+阅读 · 2021年4月19日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

A Survey of Domain Adaptation for Neural Machine Translation

Arxiv

17+阅读 · 2018年6月1日

相关基金

Aβ外周清除功能在阿尔茨海默病发生中的作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

人APOE4基因影响AD早期神经元内Aβ42沉积诱导小胶质细胞活化及神经炎症级联反应的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

NF-κB信号通路调控溶酶体相关4次跨膜蛋白质B (LAPTM4B)促人肝细胞癌增殖作用的研究

国家自然科学基金

0+阅读 · 2013年12月31日

辅助T细胞分化调节在针刺治疗神经病理痛中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Diversin介导非小细胞肺癌长春瑞滨耐药的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

低氧对大鼠EIMD肌纤维膜损伤的影响机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

15-kDa硒蛋白在内质网应激（ERS）和阿尔茨海默病(AD)中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

TLR4活化TAP63a诱导细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

钙敏感性IRE1酶"门控"作用对肝癌细胞自噬生存/死亡转归的影响及药物干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

RGC-32参与TGF-β#35825;导肾小管上皮向间充质细胞转化的分子调控机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员