说明的份量对以非洲语言命名的实体承认模式绩效的影响 (Effects of Annotations' Density on Named Entity Recognition Models' Performance in the Context of African Languages) - 专知论文

会员服务 ·

0

Performer · 命名实体识别 · entity · MoDELS · Analysis ·

2022 年 8 月 9 日

Effects of Annotations' Density on Named Entity Recognition Models' Performance in the Context of African Languages

翻译：说明的份量对以非洲语言命名的实体承认模式绩效的影响

Manuel A. Fokam

African languages have recently been the subject of several studies in Natural Language Processing (NLP) and, this has caused a significant increase in their representation in the field. However, most studies tend to focus more on the models than the quality of the datasets when assessing the models' performance in tasks such as Named Entity Recognition (NER). While this works well in most cases, it does not account for the limitations of doing NLP with low-resource languages, that is, the quality and the quantity of the dataset at our disposal. This paper provides an analysis of the performance of various models based on the quality of the dataset. We evaluate different pre-trained models with respect to the entity density per sentence of some African NER datasets. We hope with this study to improve the way NLP studies are done in the context of low-resourced languages.

翻译：非洲语言最近是若干自然语言处理(NLP)研究的主题,这导致其在外地的代表性大幅度增加,然而,大多数研究在评估模型在诸如名称实体识别(NER)等任务方面的表现时,往往更多地侧重于模型而不是数据集的质量,虽然这在多数情况下效果良好,但并没有说明在使用低资源语言(即我们掌握的数据集的质量和数量)的情况下,采用低资源语言(即质量和数量)进行国家语言处理的局限性。本文根据数据集的质量分析了各种模型的性能。我们评估了非洲一些非洲净资源数据集每句子的实体密度方面不同的预先培训模型。我们希望通过这项研究改进国家语言分类研究的方式。

0

相关内容

Performer

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

近红外吸收增强型杂化本体异质结太阳能电池构建及性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机石墨烯/银膜/介质界面表面等离子体激元散斑研究

国家自然科学基金

0+阅读 · 2013年12月31日

丙酮酸磷酸双激酶对玉米C4光合作用的影响与调控

国家自然科学基金

0+阅读 · 2012年12月31日

GaAs纳米线/In(Ga)As量子点复合径向异质结构的制备及其光伏特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

miR-499在急性心肌梗死早期辅助诊断及治疗中的作用及其分子机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

基于离子化水簇无机-有机复合固体质子导体的合成和性能

国家自然科学基金

0+阅读 · 2011年12月31日

等离子体助离子液体中可磁分离TiO2形成机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

电子回旋共振放电电离特性的PIC/MCC模拟

国家自然科学基金

0+阅读 · 2009年12月31日

新疆吐-哈地区硝酸盐巨量堆积机制及氮循环研究

国家自然科学基金

0+阅读 · 2009年12月31日

Towards a Fair Comparison and Realistic Evaluation Framework of Android Malware Detectors based on Static Analysis and Machine Learning

Arxiv

0+阅读 · 2022年10月6日

Using Full-Text Content to Characterize and Identify Best Seller Books

Arxiv

0+阅读 · 2022年10月5日

Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning

Arxiv

0+阅读 · 2022年10月4日

Characterization of effects of transfer learning across domains and languages

Characterization of effects of transfer learning across domains and languages

Arxiv

0+阅读 · 2022年10月3日

Syntax-driven Data Augmentation for Named Entity Recognition

Arxiv

0+阅读 · 2022年10月1日

Deep Learning Transformer Architecture for Named Entity Recognition on Low Resourced Languages: State of the art results

Arxiv

0+阅读 · 2022年10月1日

Building for Tomorrow: Assessing the Temporal Persistence of Text Classifiers

Arxiv

0+阅读 · 2022年10月1日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

73+阅读 · 2018年12月22日

Multimodal Sentiment Analysis To Explore the Structure of Emotions

Arxiv

19+阅读 · 2018年5月25日

Graph Convolutional Networks for Named Entity Recognition

Arxiv

17+阅读 · 2018年2月14日

VIP会员

文章信息

相关主题

命名实体识别

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Towards a Fair Comparison and Realistic Evaluation Framework of Android Malware Detectors based on Static Analysis and Machine Learning

Arxiv

0+阅读 · 2022年10月6日

Using Full-Text Content to Characterize and Identify Best Seller Books

Arxiv

0+阅读 · 2022年10月5日

Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning

Arxiv

0+阅读 · 2022年10月4日

Characterization of effects of transfer learning across domains and languages

Characterization of effects of transfer learning across domains and languages

Arxiv

0+阅读 · 2022年10月3日

Syntax-driven Data Augmentation for Named Entity Recognition

Arxiv

0+阅读 · 2022年10月1日

Deep Learning Transformer Architecture for Named Entity Recognition on Low Resourced Languages: State of the art results

Arxiv

0+阅读 · 2022年10月1日

Building for Tomorrow: Assessing the Temporal Persistence of Text Classifiers

Arxiv

0+阅读 · 2022年10月1日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

73+阅读 · 2018年12月22日

Multimodal Sentiment Analysis To Explore the Structure of Emotions

Arxiv

19+阅读 · 2018年5月25日

Graph Convolutional Networks for Named Entity Recognition

Arxiv

17+阅读 · 2018年2月14日

相关基金

近红外吸收增强型杂化本体异质结太阳能电池构建及性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机石墨烯/银膜/介质界面表面等离子体激元散斑研究

国家自然科学基金

0+阅读 · 2013年12月31日

丙酮酸磷酸双激酶对玉米C4光合作用的影响与调控

国家自然科学基金

0+阅读 · 2012年12月31日

GaAs纳米线/In(Ga)As量子点复合径向异质结构的制备及其光伏特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

miR-499在急性心肌梗死早期辅助诊断及治疗中的作用及其分子机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

基于离子化水簇无机-有机复合固体质子导体的合成和性能

国家自然科学基金

0+阅读 · 2011年12月31日

等离子体助离子液体中可磁分离TiO2形成机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

电子回旋共振放电电离特性的PIC/MCC模拟

国家自然科学基金

0+阅读 · 2009年12月31日

新疆吐-哈地区硝酸盐巨量堆积机制及氮循环研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员