多种语文机器翻译学习课程 (Competence-based Curriculum Learning for Multilingual Machine Translation)

Currently, multilingual machine translation is receiving more and more attention since it brings better performance for low resource languages (LRLs) and saves more space. However, existing multilingual machine translation models face a severe challenge: imbalance. As a result, the translation performance of different languages in multilingual translation models are quite different. We argue that this imbalance problem stems from the different learning competencies of different languages. Therefore, we focus on balancing the learning competencies of different languages and propose Competence-based Curriculum Learning for Multilingual Machine Translation, named CCL-M. Specifically, we firstly define two competencies to help schedule the high resource languages (HRLs) and the low resource languages: 1) Self-evaluated Competence, evaluating how well the language itself has been learned; and 2) HRLs-evaluated Competence, evaluating whether an LRL is ready to be learned according to HRLs' Self-evaluated Competence. Based on the above competencies, we utilize the proposed CCL-M algorithm to gradually add new languages into the training set in a curriculum learning manner. Furthermore, we propose a novel competenceaware dynamic balancing sampling strategy for better selecting training samples in multilingual training. Experimental results show that our approach has achieved a steady and significant performance gain compared to the previous state-of-the-art approach on the TED talks dataset.

翻译：目前,多语种机器翻译越来越受到越来越多的关注,因为它提高了低资源语言(LLLs)的绩效,节省了更多的空间。然而,现有的多语言机器翻译模式面临严峻的挑战:不平衡。结果,不同语言在多语种翻译模式中的翻译绩效大不相同。我们争辩说,这一不平衡问题源于不同语言的不同学习能力。因此,我们注重平衡不同语言的学习能力,并提议以CCL-M命名的多语种机器翻译基于能力的学习课程教学。具体地说,我们首先界定了两种能力,以帮助安排高资源语言(HRLs)和资源语言的时间安排:(1) 自评能力,评价语言本身的学习程度;和(2) HRLs评估能力,评价一个LLLL是否愿意根据HRLs的自评能力学习。根据上述能力,我们利用拟议的CCL-M算法,以学习方式在培训中逐步增加新的语言。此外,我们提议一种新的能力平衡抽样战略,以便在多语种培训中更好地选择培训样本。实验性结果显示,我们以往的学习方法取得了一种稳定的业绩。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【UBC】高级机器学习课程，Advanced Machine Learning

专知会员服务

26+阅读 · 2021年1月26日

专知会员服务

39+阅读 · 2020年11月3日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

【论文】多语言神经机器翻译综述（A Comprehensive Survey of Multilingual Neural Machine Translation）

专知会员服务

20+阅读 · 2020年1月7日

【综述】文献级机器翻译研究:方法与评价（A Survey on Document-level Machine Translation: Methods and Evaluation）