妇女署 CSC: 改进零热跨语文转让,采用实体中心代码转换 (EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric Code Switching)

Accurate alignment between languages is fundamental for improving cross-lingual pre-trained language models (XLMs). Motivated by the natural phenomenon of code-switching (CS) in multilingual speakers, CS has been used as an effective data augmentation method that offers language alignment at the word- or phrase-level, in contrast to sentence-level via parallel instances. Existing approaches either use dictionaries or parallel sentences with word alignment to generate CS data by randomly switching words in a sentence. However, such methods can be suboptimal as dictionaries disregard semantics, and syntax might become invalid after random word switching. In this work, we propose EntityCS, a method that focuses on Entity-level Code-Switching to capture fine-grained cross-lingual semantics without corrupting syntax. We use Wikidata and English Wikipedia to construct an entity-centric CS corpus by switching entities to their counterparts in other languages. We further propose entity-oriented masking strategies during intermediate model training on the EntityCS corpus for improving entity prediction. Evaluation of the trained models on four entity-centric downstream tasks shows consistent improvements over the baseline with a notable increase of 10% in Fact Retrieval. We release the corpus and models to assist research on code-switching and enriching XLMs with external knowledge.

翻译：语言之间的准确一致是改进跨语言预先培训的语言模式(XLM)的根本。受多语种语言代码转换(CS)自然现象的驱使,CS被用作一种有效的数据增强方法,在单词或词组一级提供语言一致性,而通过平行的句级则不同。现有的方法要么使用词典,要么用词对齐平行句来生成CS数据,在句中随机转换词词组。然而,这种方法可能是不最优化的,因为字典无视语义,在随机换字后,通识税可能变得无效。在这项工作中,我们建议EmptyCS是一个侧重于实体一级代码转换(CS)的方法,在不腐蚀词组的同时,在单词组或词组一级提供语言校正语言校正,同时将实体改换成其他语言的对应方名。我们进一步建议,在实体CSAPS文库改进实体预测的中间模式培训中,面向实体的遮掩战略可能是无效的。我们建议,在四个实体中心下游任务中经过训练的模式评价后,实体一级代码转换法系,即显示精确的改进的外部模型,并在10级数据库中进行升级后,我们协助进行显著修正。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

不可错过! CMU CMU《高级自然语言处理》结课了，附课件与视频

专知会员服务

73+阅读 · 2021年10月4日