评价在资源不足语言上嵌入背景背景内容的评价 (Evaluation of contextual embeddings on less-resourced languages) - 专知论文

会员服务 ·

0

ELMo · BERT · MoDELS · contrastive · Neural Networks ·

2021 年 7 月 22 日

Evaluation of contextual embeddings on less-resourced languages

翻译：评价在资源不足语言上嵌入背景背景内容的评价

Matej Ulčar,Aleš Žagar,Carlos S. Armendariz,Andraž Repar,Senja Pollak,Matthew Purver,Marko Robnik-Šikonja

from arxiv, 45 pages

The current dominance of deep neural networks in natural language processing is based on contextual embeddings such as ELMo, BERT, and BERT derivatives. Most existing work focuses on English; in contrast, we present here the first multilingual empirical comparison of two ELMo and several monolingual and multilingual BERT models using 14 tasks in nine languages. In monolingual settings, our analysis shows that monolingual BERT models generally dominate, with a few exceptions such as the dependency parsing task, where they are not competitive with ELMo models trained on large corpora. In cross-lingual settings, BERT models trained on only a few languages mostly do best, closely followed by massively multilingual BERT models.

翻译：目前,在自然语言处理中,深层神经网络的主导地位基于环境嵌入,如ELMO、BERT和BERT衍生物。大多数现有工作都以英语为重点;相比之下,我们在此介绍两个ELMO和若干单一语言和多语言BERT模式的第一次多语种经验比较,使用9种语言的14项任务。在单一语言环境中,我们的分析表明,单语BERT模式通常占主导地位,但有少数例外,如依赖性区分任务,与在大型公司方面受过培训的ELMO模式没有竞争力。在跨语言环境中,只对少数语言进行了最优秀的培训,紧随其后是大规模多语言的BERT模式。

0

相关内容

ELMo

近年来，研究人员通过文本上下文信息分析获得更好的词向量。ELMo是其中的翘楚，在多个任务、多个数据集上都有显著的提升。所以，它是目前最好用的词向量，the-state-of-the-art的方法。这篇文章发表在2018年的NAACL上，outstanding paper award。下面就简单介绍一下这个“神秘”的词向量模型。

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

专知会员服务

21+阅读 · 2020年6月4日

【牛津大学-DeepMind 】上下文嵌入综述，A Survey on Contextual Embeddings

【牛津大学-DeepMind 】上下文嵌入综述，A Survey on Contextual Embeddings

专知会员服务

42+阅读 · 2020年3月17日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【NAACL 2019 workshop】优化和评估神经语言生成方法 Methods for Optimizing and Evaluating Neural Language Generation，卡内基梅隆大学| Graham Neubig，纽约大学| He He

【NAACL 2019 workshop】优化和评估神经语言生成方法 Methods for Optimizing and Evaluating Neural Language Generation，卡内基梅隆大学| Graham Neubig，纽约大学| He He

专知会员服务

4+阅读 · 2019年12月5日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

【SIGIR 2019 Tutorials】有效的网络搜索在线评估（Effective Online Evaluation for Web Search）

【SIGIR 2019 Tutorials】有效的网络搜索在线评估（Effective Online Evaluation for Web Search）

专知会员服务

4+阅读 · 2019年11月17日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【NAACL 2019 workshop】自然法律语言处理 Workshop on Natural Legal Language Processing，鲁汶大学|Arthur Dyevre

【NAACL 2019 workshop】自然法律语言处理 Workshop on Natural Legal Language Processing，鲁汶大学|Arthur Dyevre

专知会员服务

4+阅读 · 2019年6月7日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

上百种预训练中文词向量：Chinese-Word-Vectors

上百种预训练中文词向量：Chinese-Word-Vectors

AINLP

23+阅读 · 2019年2月26日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

推荐｜深度强化学习聊天机器人（附论文）！

推荐｜深度强化学习聊天机器人（附论文）！

全球人工智能

4+阅读 · 2018年1月30日

论文笔记 | How NOT To Evaluate Your Dialogue System

论文笔记 | How NOT To Evaluate Your Dialogue System

科技创新与创业

13+阅读 · 2017年12月23日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Controlled Evaluation of Grammatical Knowledge in Mandarin Chinese Language Models

Arxiv

1+阅读 · 2021年9月22日

Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages

Arxiv

0+阅读 · 2021年9月22日

How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

Arxiv

0+阅读 · 2021年9月21日

Dissecting Contextual Word Embeddings: Architecture and Representation

Dissecting Contextual Word Embeddings: Architecture and Representation

Arxiv

22+阅读 · 2018年8月27日

Why Self-Attention? A Targeted Evaluation of Neural Machine Translation Architectures

Arxiv

3+阅读 · 2018年8月27日

Multi-task Learning for Universal Sentence Embeddings: A Thorough Evaluation using Transfer and Auxiliary Tasks

Multi-task Learning for Universal Sentence Embeddings: A Thorough Evaluation using Transfer and Auxiliary Tasks

Arxiv

3+阅读 · 2018年8月16日

Multi-Task Neural Models for Translating Between Styles Within and Across Languages

Arxiv

4+阅读 · 2018年6月12日

Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Arxiv

4+阅读 · 2018年4月26日

A Comparison of Word Embeddings for the Biomedical Natural Language Processing

Arxiv

3+阅读 · 2018年2月1日

Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks

Arxiv

3+阅读 · 2018年1月23日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

专知会员服务

21+阅读 · 2020年6月4日

【牛津大学-DeepMind 】上下文嵌入综述，A Survey on Contextual Embeddings

【牛津大学-DeepMind 】上下文嵌入综述，A Survey on Contextual Embeddings

专知会员服务

42+阅读 · 2020年3月17日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【NAACL 2019 workshop】优化和评估神经语言生成方法 Methods for Optimizing and Evaluating Neural Language Generation，卡内基梅隆大学| Graham Neubig，纽约大学| He He

【NAACL 2019 workshop】优化和评估神经语言生成方法 Methods for Optimizing and Evaluating Neural Language Generation，卡内基梅隆大学| Graham Neubig，纽约大学| He He

专知会员服务

4+阅读 · 2019年12月5日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

【SIGIR 2019 Tutorials】有效的网络搜索在线评估（Effective Online Evaluation for Web Search）

【SIGIR 2019 Tutorials】有效的网络搜索在线评估（Effective Online Evaluation for Web Search）

专知会员服务

4+阅读 · 2019年11月17日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【NAACL 2019 workshop】自然法律语言处理 Workshop on Natural Legal Language Processing，鲁汶大学|Arthur Dyevre

【NAACL 2019 workshop】自然法律语言处理 Workshop on Natural Legal Language Processing，鲁汶大学|Arthur Dyevre

专知会员服务

4+阅读 · 2019年6月7日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

上百种预训练中文词向量：Chinese-Word-Vectors

上百种预训练中文词向量：Chinese-Word-Vectors

AINLP

23+阅读 · 2019年2月26日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

推荐｜深度强化学习聊天机器人（附论文）！

推荐｜深度强化学习聊天机器人（附论文）！

全球人工智能

4+阅读 · 2018年1月30日

论文笔记 | How NOT To Evaluate Your Dialogue System

论文笔记 | How NOT To Evaluate Your Dialogue System

科技创新与创业

13+阅读 · 2017年12月23日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Controlled Evaluation of Grammatical Knowledge in Mandarin Chinese Language Models

Arxiv

1+阅读 · 2021年9月22日

Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages

Arxiv

0+阅读 · 2021年9月22日

How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

Arxiv

0+阅读 · 2021年9月21日

Dissecting Contextual Word Embeddings: Architecture and Representation

Dissecting Contextual Word Embeddings: Architecture and Representation

Arxiv

22+阅读 · 2018年8月27日

Why Self-Attention? A Targeted Evaluation of Neural Machine Translation Architectures

Arxiv

3+阅读 · 2018年8月27日

Multi-task Learning for Universal Sentence Embeddings: A Thorough Evaluation using Transfer and Auxiliary Tasks

Multi-task Learning for Universal Sentence Embeddings: A Thorough Evaluation using Transfer and Auxiliary Tasks

Arxiv

3+阅读 · 2018年8月16日

Multi-Task Neural Models for Translating Between Styles Within and Across Languages

Arxiv

4+阅读 · 2018年6月12日

Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Arxiv

4+阅读 · 2018年4月26日

A Comparison of Word Embeddings for the Biomedical Natural Language Processing

Arxiv

3+阅读 · 2018年2月1日

Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks

Arxiv

3+阅读 · 2018年1月23日

微信扫码咨询专知VIP会员