查找域特定神经机器翻译的简单结构 (Finding Sparse Structures for Domain Specific Neural Machine Translation)

Neural machine translation often adopts the fine-tuning approach to adapt to specific domains. However, nonrestricted fine-tuning can easily degrade on the general domain and over-fit to the target domain. To mitigate the issue, we propose Prune-Tune, a novel domain adaptation method via gradual pruning. It learns tiny domain-specific sub-networks during fine-tuning on new domains. Prune-Tune alleviates the over-fitting and the degradation problem without model modification. Furthermore, Prune-Tune is able to sequentially learn a single network with multiple disjoint domain-specific sub-networks for multiple domains. Empirical experiment results show that Prune-Tune outperforms several strong competitors in the target domain test set without sacrificing the quality on the general domain in both single and multi-domain settings. The source code and data are available at https://github.com/ohlionel/Prune-Tune.

翻译：神经机器翻译往往采用微调方法适应特定领域。然而,非限制性微调很容易在一般领域降解,并且过于适合目标领域。为了缓解这一问题,我们提议通过逐步修剪来采用新型领域适应方法Prune-Tune。在对新领域进行微调时,它学习了微小的域别子网络。Prune-Tune在不作模型修改的情况下减轻了过度装配和退化问题。此外,Prune-Tune能够按部就班地学习一个单一网络,为多个领域建立多个互不连域专用子网络。经验实验结果显示,Prune-Tune在目标领域测试中优于几个强大的竞争者,而没有牺牲单一和多域环境中的一般领域的质量。源代码和数据见https://github.com/ohlionel/Prune-Tune。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

【Facebook AI】无监督机器翻译，336页ppt，Unsupervised Machine Translation

专知会员服务

19+阅读 · 2020年11月17日

专知会员服务

39+阅读 · 2020年11月3日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation