富国和富国代表 (Richer Countries and Richer Representations) - 专知论文

会员服务 ·

0

Less · Cocoa · Performer · 表示 · 相关系数 ·

2022 年 5 月 10 日

Richer Countries and Richer Representations

翻译：富国和富国代表

Kaitlyn Zhou,Kawin Ethayarajh,Dan Jurafsky

from arxiv, Camera Ready for ACL 2022 (Findings)

We examine whether some countries are more richly represented in embedding space than others. We find that countries whose names occur with low frequency in training corpora are more likely to be tokenized into subwords, are less semantically distinct in embedding space, and are less likely to be correctly predicted: e.g., Ghana (the correct answer and in-vocabulary) is not predicted for, "The country producing the most cocoa is [MASK].". Although these performance discrepancies and representational harms are due to frequency, we find that frequency is highly correlated with a country's GDP; thus perpetuating historic power and wealth inequalities. We analyze the effectiveness of mitigation strategies; recommend that researchers report training word frequencies; and recommend future work for the community to define and design representational guarantees.

翻译：我们研究一些国家在嵌入空间方面是否比其他国家更具有更丰富的代表性。我们发现,在培训公司中低频率出现国名的国家更有可能被象征成子字,在嵌入空间方面不太具有内在的区别,也不太可能得到正确预测:例如,加纳(正确的答案和在词汇中)没有预测到“生产可可最多的国家是[MASK]”。虽然这些绩效差异和代表性伤害是频繁造成的,但我们发现,频率与一个国家的GDP密切相关,从而延续了历史权力和财富不平等。我们分析了缓解战略的有效性;建议研究人员报告语言频率培训;建议社区今后界定和设计代表性保障的工作。

0

相关内容

Less

LESS 是一个开源的样式语言，受到 Sass 的影响。严格来说，LESS 是一个嵌套的元语言，符合语法规范的 CSS 语句也是符合规范的 Less 代码。

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

35+阅读 · 2022年3月5日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

二氢丹参酮I对多药耐药结肠癌及胃肠肿瘤活化成纤维细胞中代谢重编程的调控作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

nAChR/Prx1轴在烟草相关口腔白斑细胞凋亡中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于甾体激素受体核转录调控途径白术黄酮苷AMFG-5改善高雄诱导的卵泡发育障碍的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Beclin 1在阿尔茨海默病样神经元损伤中的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

结合连锁和关联分析方法剖析玉米硝酸还原酶的遗传机制

国家自然科学基金

0+阅读 · 2013年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

低温等离子体物理及应用战略研讨

国家自然科学基金

0+阅读 · 2012年8月31日

CyPA在OX40-OX40L受体-配体轴调控动脉粥样斑块形成中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

SNAREs对动脉内皮细胞凋亡增殖的影响及其在动脉粥样硬化发病机制中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping

BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping

Arxiv

0+阅读 · 2022年6月30日

Where to Begin? Exploring the Impact of Pre-Training and Initialization in Federated Learning

Arxiv

0+阅读 · 2022年6月30日

Visual grounding of abstract and concrete words: A response to Günther et al. (2020)

Arxiv

0+阅读 · 2022年6月30日

Modeling Teams Performance Using Deep Representational Learning on Graphs

Arxiv

0+阅读 · 2022年6月29日

Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations

Arxiv

0+阅读 · 2022年6月29日

Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction

Arxiv

0+阅读 · 2022年6月29日

EvilModel 2.0: Bringing Neural Network Models into Malware Attacks

Arxiv

0+阅读 · 2022年6月28日

Learning the Evolutionary and Multi-scale Graph Structure for Multivariate Time Series Forecasting

Arxiv

0+阅读 · 2022年6月28日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

VIP会员

文章信息

相关主题

相关VIP内容

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

35+阅读 · 2022年3月5日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】面向视觉、物理与语言应用的可信机器学习模型

医学领域大型语言模型的新进展

战场AI决策支持系统

【NeurIPS 2025】视觉指令瓶颈微调

相关资讯

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping

BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping

Arxiv

0+阅读 · 2022年6月30日

Where to Begin? Exploring the Impact of Pre-Training and Initialization in Federated Learning

Arxiv

0+阅读 · 2022年6月30日

Visual grounding of abstract and concrete words: A response to Günther et al. (2020)

Arxiv

0+阅读 · 2022年6月30日

Modeling Teams Performance Using Deep Representational Learning on Graphs

Arxiv

0+阅读 · 2022年6月29日

Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations

Arxiv

0+阅读 · 2022年6月29日

Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction

Arxiv

0+阅读 · 2022年6月29日

EvilModel 2.0: Bringing Neural Network Models into Malware Attacks

Arxiv

0+阅读 · 2022年6月28日

Learning the Evolutionary and Multi-scale Graph Structure for Multivariate Time Series Forecasting

Arxiv

0+阅读 · 2022年6月28日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

相关基金

二氢丹参酮I对多药耐药结肠癌及胃肠肿瘤活化成纤维细胞中代谢重编程的调控作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

nAChR/Prx1轴在烟草相关口腔白斑细胞凋亡中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于甾体激素受体核转录调控途径白术黄酮苷AMFG-5改善高雄诱导的卵泡发育障碍的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Beclin 1在阿尔茨海默病样神经元损伤中的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

结合连锁和关联分析方法剖析玉米硝酸还原酶的遗传机制

国家自然科学基金

0+阅读 · 2013年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

低温等离子体物理及应用战略研讨

国家自然科学基金

0+阅读 · 2012年8月31日

CyPA在OX40-OX40L受体-配体轴调控动脉粥样斑块形成中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

SNAREs对动脉内皮细胞凋亡增殖的影响及其在动脉粥样硬化发病机制中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员