多语种模式在代码转换方面是否有效? (Are Multilingual Models Effective in Code-Switching?)

Multilingual language models have shown decent performance in multilingual and cross-lingual natural language understanding tasks. However, the power of these multilingual models in code-switching tasks has not been fully explored. In this paper, we study the effectiveness of multilingual language models to understand their capability and adaptability to the mixed-language setting by considering the inference speed, performance, and number of parameters to measure their practicality. We conduct experiments in three language pairs on named entity recognition and part-of-speech tagging and compare them with existing methods, such as using bilingual embeddings and multilingual meta-embeddings. Our findings suggest that pre-trained multilingual models do not necessarily guarantee high-quality representations on code-switching, while using meta-embeddings achieves similar results with significantly fewer parameters.

翻译：多语种模式在多语种和跨语种的自然语言理解任务方面表现良好,但是,这些多语种模式在代码转换任务中的力量尚未得到充分探讨,在本文件中,我们研究多语种模式在理解其能力和适应混合语言环境方面的效力,方法是考虑推论速度、性能和衡量其实用性参数的数量。我们用三对语言进行实验,研究名称实体的识别和部分语音标记,并将其与现有方法进行比较,例如使用双语嵌入和多语种元编组。我们的调查结果表明,预先培训的多语种模式不一定保证在代码转换方面高质量的代表性,而使用元编组的参数则要少得多。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日