多重利益能否是非自动机械翻译? (Can Multilinguality benefit Non-autoregressive Machine Translation?)

Non-autoregressive (NAR) machine translation has recently achieved significant improvements, and now outperforms autoregressive (AR) models on some benchmarks, providing an efficient alternative to AR inference. However, while AR translation is often implemented using multilingual models that benefit from transfer between languages and from improved serving efficiency, multilingual NAR models remain relatively unexplored. Taking Connectionist Temporal Classification (CTC) as an example NAR model and Imputer as a semi-NAR model, we present a comprehensive empirical study of multilingual NAR. We test its capabilities with respect to positive transfer between related languages and negative transfer under capacity constraints. As NAR models require distilled training sets, we carefully study the impact of bilingual versus multilingual teachers. Finally, we fit a scaling law for multilingual NAR, which quantifies its performance relative to the AR model as model scale increases.

翻译：最近,非自发性机器翻译(NAR)取得了显著改进,目前已在某些基准上优于自动递减模式,为AR的推论提供了有效的替代方法;然而,虽然AR的翻译往往使用多语种模型,这些模型受益于语言之间的转让和服务的提高,多语言的NAR模型仍然相对没有被探索;以NAR模型为例,将Interute作为半NAR模型,我们对多语言NAR进行了全面的经验研究;我们测试了它在相关语言之间的积极转让和能力限制下的负转移方面的能力;由于NAR模型需要精练的培训,我们仔细研究了双语教师与多语言教师的影响;最后,我们为多语言的ARAR设计了规模法,在模型增加时将其业绩与AR模型相比进行量化。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

多标签学习的新趋势（2020 Survey）

专知会员服务

44+阅读 · 2020年12月6日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【论文】多语言神经机器翻译综述（A Comprehensive Survey of Multilingual Neural Machine Translation）

专知会员服务

20+阅读 · 2020年1月7日