Currently, multilingual machine translation is receiving more and more attention since it brings better performance for low resource languages (LRLs) and saves more space. However, existing multilingual machine translation models face a severe challenge: imbalance. As a result, the translation performance of different languages in multilingual translation models are quite different. We argue that this imbalance problem stems from the different learning competencies of different languages. Therefore, we focus on balancing the learning competencies of different languages and propose Competence-based Curriculum Learning for Multilingual Machine Translation, named CCL-M. Specifically, we firstly define two competencies to help schedule the high resource languages (HRLs) and the low resource languages: 1) Self-evaluated Competence, evaluating how well the language itself has been learned; and 2) HRLs-evaluated Competence, evaluating whether an LRL is ready to be learned according to HRLs' Self-evaluated Competence. Based on the above competencies, we utilize the proposed CCL-M algorithm to gradually add new languages into the training set in a curriculum learning manner. Furthermore, we propose a novel competenceaware dynamic balancing sampling strategy for better selecting training samples in multilingual training. Experimental results show that our approach has achieved a steady and significant performance gain compared to the previous state-of-the-art approach on the TED talks dataset.
翻译:目前,多语种机器翻译越来越受到越来越多的关注,因为它提高了低资源语言(LLLs)的绩效,节省了更多的空间。然而,现有的多语言机器翻译模式面临严峻的挑战:不平衡。结果,不同语言在多语种翻译模式中的翻译绩效大不相同。我们争辩说,这一不平衡问题源于不同语言的不同学习能力。因此,我们注重平衡不同语言的学习能力,并提议以CCL-M命名的多语种机器翻译基于能力的学习课程教学。具体地说,我们首先界定了两种能力,以帮助安排高资源语言(HRLs)和资源语言的时间安排:(1) 自评能力,评价语言本身的学习程度;和(2) HRLs评估能力,评价一个LLLL是否愿意根据HRLs的自评能力学习。根据上述能力,我们利用拟议的CCL-M算法,以学习方式在培训中逐步增加新的语言。此外,我们提议一种新的能力平衡抽样战略,以便在多语种培训中更好地选择培训样本。实验性结果显示,我们以往的学习方法取得了一种稳定的业绩。