BERT中减少依赖语言的族裔偏见 (Mitigating Language-Dependent Ethnic Bias in BERT) - 专知论文

会员服务 ·

0

有偏 · BERT · MoDELS · 语言模型化 · Better ·

2021 年 9 月 13 日

Mitigating Language-Dependent Ethnic Bias in BERT

翻译：BERT中减少依赖语言的族裔偏见

Jaimeen Ahn,Alice Oh

from arxiv, 17 pages including references and appendix. To be appear in EMNLP 2021 (camera-ready ver.)

BERT and other large-scale language models (LMs) contain gender and racial bias. They also exhibit other dimensions of social bias, most of which have not been studied in depth, and some of which vary depending on the language. In this paper, we study ethnic bias and how it varies across languages by analyzing and mitigating ethnic bias in monolingual BERT for English, German, Spanish, Korean, Turkish, and Chinese. To observe and quantify ethnic bias, we develop a novel metric called Categorical Bias score. Then we propose two methods for mitigation; first using a multilingual model, and second using contextual word alignment of two monolingual models. We compare our proposed methods with monolingual BERT and show that these methods effectively alleviate the ethnic bias. Which of the two methods works better depends on the amount of NLP resources available for that language. We additionally experiment with Arabic and Greek to verify that our proposed methods work for a wider variety of languages.

翻译：BERT和其他大型语言模式含有性别和种族偏见,它们也表现出社会偏见的其他方面,其中多数尚未深入研究,有些则因语言而异。在本文中,我们研究族裔偏见以及不同语言的种族偏见,方法是用英语、德语、西班牙语、韩语、土耳其语和中文单语语言语言语言的BERT来分析和减少族裔偏见。为了观察和量化族裔偏见,我们开发了一个叫作分类比亚斯分的新颖的衡量标准。然后我们提出两种缓解方法;首先使用多语种模式,其次是使用两种单一语言模式的背景词对齐。我们比较了我们提出的方法,并表明这些方法有效地缓解了种族偏见。这两种方法中的哪一种方法更适合用于该语言的NLP资源量。我们还与阿拉伯语和希腊语进行了进一步试验,以核实我们提出的方法是否适用于更广泛的语言。

0

相关内容

【干货书】计算成像，483页pdf，Computational Imaging Book, MIT 出版社

专知会员服务

65+阅读 · 2021年9月12日

不可错过！CMU《机器学习导论》2021课程，ML祖师爷Tom Mitchell带队主讲

不可错过！CMU《机器学习导论》2021课程，ML祖师爷Tom Mitchell带队主讲

专知会员服务

64+阅读 · 2021年3月20日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【MIT深度学习课程】深度序列建模，Deep Sequence Modeling

【MIT深度学习课程】深度序列建模，Deep Sequence Modeling

专知会员服务

78+阅读 · 2020年2月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【斯坦福大学AAAI2020】跨越因果层次的概率推理，Probabilistic Reasoning across the Causal Hierarchy

【斯坦福大学AAAI2020】跨越因果层次的概率推理，Probabilistic Reasoning across the Causal Hierarchy

专知会员服务

46+阅读 · 2020年1月11日

预训练语言模型BERT，Jacob Devlin斯坦福演讲PPT：BERT介绍与答疑，35页ppt

预训练语言模型BERT，Jacob Devlin斯坦福演讲PPT：BERT介绍与答疑，35页ppt

专知会员服务

112+阅读 · 2020年1月7日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

BERT系列文章汇总导读

BERT系列文章汇总导读

AINLP

12+阅读 · 2019年8月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

BERT烹饪之法：fintune 的艺术

BERT烹饪之法：fintune 的艺术

大数据文摘

4+阅读 · 2019年4月20日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

自然语言处理顶会EMNLP2018接受论文列表！

自然语言处理顶会EMNLP2018接受论文列表！

专知

87+阅读 · 2018年8月26日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

LMdiff: A Visual Diff Tool to Compare Language Models

Arxiv

0+阅读 · 2021年11月2日

Towards Language Modelling in the Speech Domain Using Sub-word Linguistic Units

Arxiv

0+阅读 · 2021年10月31日

Identifying and mitigating bias in algorithms used to manage patients in a pandemic

Arxiv

0+阅读 · 2021年10月30日

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models

Arxiv

0+阅读 · 2021年10月30日

Counterfactual VQA: A Cause-Effect Look at Language Bias

Arxiv

16+阅读 · 2020年12月28日

Equivalent Causal Models

Arxiv

5+阅读 · 2020年12月10日

Natural Language Inference in Context -- Investigating Contextual Reasoning over Long Texts

Arxiv

6+阅读 · 2020年11月10日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Commonsense Knowledge + BERT for Level 2 Reading Comprehension Ability Test

Arxiv

4+阅读 · 2019年9月8日

Neural Network Models for Paraphrase Identification, Semantic Textual Similarity, Natural Language Inference, and Question Answering

Arxiv

7+阅读 · 2018年6月12日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

【干货书】计算成像，483页pdf，Computational Imaging Book, MIT 出版社

专知会员服务

65+阅读 · 2021年9月12日

不可错过！CMU《机器学习导论》2021课程，ML祖师爷Tom Mitchell带队主讲

不可错过！CMU《机器学习导论》2021课程，ML祖师爷Tom Mitchell带队主讲

专知会员服务

64+阅读 · 2021年3月20日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【MIT深度学习课程】深度序列建模，Deep Sequence Modeling

【MIT深度学习课程】深度序列建模，Deep Sequence Modeling

专知会员服务

78+阅读 · 2020年2月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【斯坦福大学AAAI2020】跨越因果层次的概率推理，Probabilistic Reasoning across the Causal Hierarchy

【斯坦福大学AAAI2020】跨越因果层次的概率推理，Probabilistic Reasoning across the Causal Hierarchy

专知会员服务

46+阅读 · 2020年1月11日

预训练语言模型BERT，Jacob Devlin斯坦福演讲PPT：BERT介绍与答疑，35页ppt

预训练语言模型BERT，Jacob Devlin斯坦福演讲PPT：BERT介绍与答疑，35页ppt

专知会员服务

112+阅读 · 2020年1月7日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

BERT系列文章汇总导读

BERT系列文章汇总导读

AINLP

12+阅读 · 2019年8月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

BERT烹饪之法：fintune 的艺术

BERT烹饪之法：fintune 的艺术

大数据文摘

4+阅读 · 2019年4月20日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

自然语言处理顶会EMNLP2018接受论文列表！

自然语言处理顶会EMNLP2018接受论文列表！

专知

87+阅读 · 2018年8月26日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

LMdiff: A Visual Diff Tool to Compare Language Models

Arxiv

0+阅读 · 2021年11月2日

Towards Language Modelling in the Speech Domain Using Sub-word Linguistic Units

Arxiv

0+阅读 · 2021年10月31日

Identifying and mitigating bias in algorithms used to manage patients in a pandemic

Arxiv

0+阅读 · 2021年10月30日

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models

Arxiv

0+阅读 · 2021年10月30日

Counterfactual VQA: A Cause-Effect Look at Language Bias

Arxiv

16+阅读 · 2020年12月28日

Equivalent Causal Models

Arxiv

5+阅读 · 2020年12月10日

Natural Language Inference in Context -- Investigating Contextual Reasoning over Long Texts

Arxiv

6+阅读 · 2020年11月10日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Commonsense Knowledge + BERT for Level 2 Reading Comprehension Ability Test

Arxiv

4+阅读 · 2019年9月8日

Neural Network Models for Paraphrase Identification, Semantic Textual Similarity, Natural Language Inference, and Question Answering

Arxiv

7+阅读 · 2018年6月12日

微信扫码咨询专知VIP会员