ERNIE-M:通过将跨语言语言语义与单语言公司统一起来,加强多语言代表性 (ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora)

Recent studies have demonstrated that pre-trained cross-lingual models achieve impressive performance in downstream cross-lingual tasks. This improvement benefits from learning a large amount of monolingual and parallel corpora. Although it is generally acknowledged that parallel corpora are critical for improving the model performance, existing methods are often constrained by the size of parallel corpora, especially for low-resource languages. In this paper, we propose ERNIE-M, a new training method that encourages the model to align the representation of multiple languages with monolingual corpora, to overcome the constraint that the parallel corpus size places on the model performance. Our key insight is to integrate back-translation into the pre-training process. We generate pseudo-parallel sentence pairs on a monolingual corpus to enable the learning of semantic alignments between different languages, thereby enhancing the semantic modeling of cross-lingual models. Experimental results show that ERNIE-M outperforms existing cross-lingual models and delivers new state-of-the-art results in various cross-lingual downstream tasks.

翻译：最近的研究显示,经过培训的跨语言模式在下游跨语言任务中取得了令人印象深刻的成绩,学习大量单一语言和平行社团有助于这一改进。虽然人们普遍承认平行社团对于改进模式绩效至关重要,但现有方法往往受到平行社团规模的限制,特别是对于低资源语言而言。在本论文中,我们提议了ERNIE-M这一新的培训方法,鼓励该模式将多种语言的表述与单一语言的社团统一起来,以克服平行体体积对模式绩效的制约。我们的关键见解是将回译纳入培训前过程。我们制作了单语言组合的假单语言句配对,以便能够学习不同语言之间的语义一致性,从而加强跨语言模式的语义模型。实验结果表明,ERNIE-M超越了现有的跨语言模式,并在各种跨语言的下游任务中提供新的最新成果。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】计算机科学，647页pdf，Computer Science

专知会员服务

46+阅读 · 2021年5月10日

【Facebook AI】无监督机器翻译，336页ppt，Unsupervised Machine Translation

专知会员服务

18+阅读 · 2020年11月17日

【微软亚洲研究院】无监督词嵌入对齐的几何感知域自适应，Geometry-aware Domain Adaptation for Unsupervised Alignment of Word Embeddings

专知会员服务

23+阅读 · 2020年4月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日