KorealBERT: 韩国语言理解利特BERT模式预培训 (KoreALBERT: Pretraining a Lite BERT Model for Korean Language Understanding)

A Lite BERT (ALBERT) has been introduced to scale up deep bidirectional representation learning for natural languages. Due to the lack of pretrained ALBERT models for Korean language, the best available practice is the multilingual model or resorting back to the any other BERT-based model. In this paper, we develop and pretrain KoreALBERT, a monolingual ALBERT model specifically for Korean language understanding. We introduce a new training objective, namely Word Order Prediction (WOP), and use alongside the existing MLM and SOP criteria to the same architecture and model parameters. Despite having significantly fewer model parameters (thus, quicker to train), our pretrained KoreALBERT outperforms its BERT counterpart on 6 different NLU tasks. Consistent with the empirical results in English by Lan et al., KoreALBERT seems to improve downstream task performance involving multi-sentence encoding for Korean language. The pretrained KoreALBERT is publicly available to encourage research and application development for Korean NLP.

翻译：为了扩大对自然语言的深度双向代表制学习(ALBERT),引入了远程语言双向代表制学习(ALBERT),因为韩国语言缺乏经过预先培训的ALBERT模式,所以最佳可得做法是多语种模式,或者回到任何其他基于BERT的模式。在本文中,我们开发了单语语言的KoreALBERT模式,这是专门用于朝鲜语言理解的单语种ALBERT模式。我们引入了一个新的培训目标,即Word Consourment(WOP),并同时将现有的MLM和SOP标准用于相同的结构和模型参数。尽管我们经过培训的KoreALBERT模型参数(因此,培训速度要快得多)大大少于模型参数,但我们经过培训的KOreALBERT在6项不同的NLU任务上超越了BERT的对应标准。根据Lan等人的英语经验,KoreALBERT似乎改进了韩国语言多语种编码的下游任务绩效。经过培训的KoreALBERT公开用于鼓励韩国国家语言的研究和应用开发。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS 2020】融入BERT到并行序列模型

专知会员服务

26+阅读 · 2020年10月15日

【ACL2020】DeeBERT:动态加速BERT推理，DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

专知会员服务

21+阅读 · 2020年4月30日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

专知会员服务

52+阅读 · 2020年1月20日