NLP研究中的地理引用差距 (Geographic Citation Gaps in NLP Research) - 专知论文

会员服务 ·

0

NLP · 论文 · HTTPS · 数据集 · Facebook AI Research ·

2022 年 10 月 26 日

Geographic Citation Gaps in NLP Research

翻译：NLP研究中的地理引用差距

Mukund Rungta,Janvijay Singh,Saif M. Mohammad,Diyi Yang

from arxiv, EMNLP 2022 Main Conference

In a fair world, people have equitable opportunities to education, to conduct scientific research, to publish, and to get credit for their work, regardless of where they live. However, it is common knowledge among researchers that a vast number of papers accepted at top NLP venues come from a handful of western countries and (lately) China; whereas, very few papers from Africa and South America get published. Similar disparities are also believed to exist for paper citation counts. In the spirit of "what we do not measure, we cannot improve", this work asks a series of questions on the relationship between geographical location and publication success (acceptance in top NLP venues and citation impact). We first created a dataset of 70,000 papers from the ACL Anthology, extracted their meta-information, and generated their citation network. We then show that not only are there substantial geographical disparities in paper acceptance and citation but also that these disparities persist even when controlling for a number of variables such as venue of publication and sub-field of NLP. Further, despite some steps taken by the NLP community to improve geographical diversity, we show that the disparity in publication metrics across locations is still on an increasing trend since the early 2000s. We release our code and dataset here: https://github.com/iamjanvijay/acl-cite-net

翻译：在一个公平的世界里,人们有平等的机会接受教育、进行科学研究、出版和获得工作荣誉,而不管他们住在哪里。然而,研究人员普遍知道,在最高国家劳工局所在地接受的大量论文来自少数几个西方国家和(最近)中国;而来自非洲和南美洲的论文却很少出版。在纸张引用计数方面,也认为存在着类似的差异。本着“我们没有衡量的东西,我们无法改进”的精神,这项工作要求就地理位置与出版成功之间的关系提出一系列问题(在最高国家劳工局所在地和引言影响方面得到认可)。我们首先创建了70,000篇来自美国劳工局安思科的论文数据集,提取了他们的元信息,并创建了他们的引文网络。我们随后表明,不仅在纸张接受和引用方面存在着巨大的地理差异,而且在控制诸如出版地点和NLP的子领域等若干变量时,这些差异依然存在。此外,尽管国家劳工局社区为改善地域多样性采取了一些步骤,但我们展示了出版量表/数字系统在各地的差异。自2000年早期数据发布以来,我们的数据系统/网络的数据仍在不断增长。

0

相关内容

NLP

NLP:自然语言处理

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于动态匹配EIV模型的星载波模式SAR涌浪方向谱误差分析方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

小花棘豆Embellisia内生真菌的酵母氨酸还原酶基因在苦马豆素代谢中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

辅助性胶凝材料负载纳米碳纤维的优化设计及其与水泥基材料相互作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于均苯三甲酰胺骨架的超分子催化剂的构筑及其催化性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Actinophyllic Acid类含七元环的复杂多环活性天然产物全合成研究

国家自然科学基金

0+阅读 · 2014年12月31日

克氏锥虫鲨烯合成酶tcSQS催化反应机理的研究及基于复合物结构的抑制剂设计

国家自然科学基金

0+阅读 · 2013年12月31日

大气压直流微等离子体辅助液相合成和液相修饰纳米粒子研究

国家自然科学基金

0+阅读 · 2012年12月31日

肝移植胆道周围血管丛缺血性损伤中的MAC作用机制及对缺血型胆道病变的影响研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Petri网和DSM的型号产品协同设计过程和数据世系建模及分析方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

双手性氢键给体离子液体型不对称有机催化体系的设计合成与构效关系研究

国家自然科学基金

0+阅读 · 2009年12月31日

Collaborating Heterogeneous Natural Language Processing Tasks via Federated Learning

Arxiv

0+阅读 · 2022年12月12日

Is Research Funding Always Beneficial? A Cross-Disciplinary Analysis of UK Research 2014-20

Arxiv

0+阅读 · 2022年12月11日

In which fields are citations indicators of research quality?

Arxiv

0+阅读 · 2022年12月11日

Scaling pattern mining through non-overlapping variable partitioning

Arxiv

0+阅读 · 2022年12月10日

BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model

BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model

Arxiv

0+阅读 · 2022年12月9日

A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective

Arxiv

21+阅读 · 2022年9月27日

A survey of embedding models of entities and relationships for knowledge graph completion

Arxiv

23+阅读 · 2020年8月10日

Multi-Label Text Classification using Attention-based Graph Neural Network

Arxiv

46+阅读 · 2020年3月22日

Which Knowledge Graph Is Best for Me?

Arxiv

11+阅读 · 2018年9月28日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

VIP会员

文章信息

相关主题

Facebook AI Research

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Collaborating Heterogeneous Natural Language Processing Tasks via Federated Learning

Arxiv

0+阅读 · 2022年12月12日

Is Research Funding Always Beneficial? A Cross-Disciplinary Analysis of UK Research 2014-20

Arxiv

0+阅读 · 2022年12月11日

In which fields are citations indicators of research quality?

Arxiv

0+阅读 · 2022年12月11日

Scaling pattern mining through non-overlapping variable partitioning

Arxiv

0+阅读 · 2022年12月10日

BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model

BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model

Arxiv

0+阅读 · 2022年12月9日

A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective

Arxiv

21+阅读 · 2022年9月27日

A survey of embedding models of entities and relationships for knowledge graph completion

Arxiv

23+阅读 · 2020年8月10日

Multi-Label Text Classification using Attention-based Graph Neural Network

Arxiv

46+阅读 · 2020年3月22日

Which Knowledge Graph Is Best for Me?

Arxiv

11+阅读 · 2018年9月28日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

相关基金

基于动态匹配EIV模型的星载波模式SAR涌浪方向谱误差分析方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

小花棘豆Embellisia内生真菌的酵母氨酸还原酶基因在苦马豆素代谢中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

辅助性胶凝材料负载纳米碳纤维的优化设计及其与水泥基材料相互作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于均苯三甲酰胺骨架的超分子催化剂的构筑及其催化性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Actinophyllic Acid类含七元环的复杂多环活性天然产物全合成研究

国家自然科学基金

0+阅读 · 2014年12月31日

克氏锥虫鲨烯合成酶tcSQS催化反应机理的研究及基于复合物结构的抑制剂设计

国家自然科学基金

0+阅读 · 2013年12月31日

大气压直流微等离子体辅助液相合成和液相修饰纳米粒子研究

国家自然科学基金

0+阅读 · 2012年12月31日

肝移植胆道周围血管丛缺血性损伤中的MAC作用机制及对缺血型胆道病变的影响研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Petri网和DSM的型号产品协同设计过程和数据世系建模及分析方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

双手性氢键给体离子液体型不对称有机催化体系的设计合成与构效关系研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员