语言模式的可持续模式贬低 (Sustainable Modular Debiasing of Language Models) - 专知论文

会员服务 ·

0

语言模型化 · 有偏 · Extensibility · MoDELS · 原点 ·

2021 年 9 月 8 日

Sustainable Modular Debiasing of Language Models

翻译：语言模式的可持续模式贬低

Anne Lauscher,Tobias Lüken,Goran Glavaš

from arxiv, Accepted for EMNLP-Findings 2021

Unfair stereotypical biases (e.g., gender, racial, or religious biases) encoded in modern pretrained language models (PLMs) have negative ethical implications for widespread adoption of state-of-the-art language technology. To remedy for this, a wide range of debiasing techniques have recently been introduced to remove such stereotypical biases from PLMs. Existing debiasing methods, however, directly modify all of the PLMs parameters, which -- besides being computationally expensive -- comes with the inherent risk of (catastrophic) forgetting of useful language knowledge acquired in pretraining. In this work, we propose a more sustainable modular debiasing approach based on dedicated debiasing adapters, dubbed ADELE. Concretely, we (1) inject adapter modules into the original PLM layers and (2) update only the adapters (i.e., we keep the original PLM parameters frozen) via language modeling training on a counterfactually augmented corpus. We showcase ADELE, in gender debiasing of BERT: our extensive evaluation, encompassing three intrinsic and two extrinsic bias measures, renders ADELE, very effective in bias mitigation. We further show that -- due to its modular nature -- ADELE, coupled with task adapters, retains fairness even after large-scale downstream training. Finally, by means of multilingual BERT, we successfully transfer ADELE, to six target languages.

翻译：现代预先培训语言模式(PLM)所编码的不公平的定型偏见(如性别、种族或宗教偏见)对广泛采用最先进的语言技术具有负面的道德影响。为了纠正这种情况,最近引进了广泛的贬低技术,以消除PLM的这种定型偏见。但是,现有的贬低方法直接修改所有PLM参数,这些参数除了计算成本昂贵外,还具有内在的风险,即(灾难性的)忘记了在培训前获得的有用语言知识。在这项工作中,我们建议一种更可持续的模块式贬低方法,其基础是专门减少偏见的适应者,称为ADELE。具体地说,我们(1) 将适应者模块注入最初的PLM层次,(2) 仅更新适应者(即,我们把原有的PLM参数冻结在反实际增强能力的培训中)。我们用性别偏差来展示ADELE:我们的广泛评价,包括三种内在的和两个外部的定型偏差措施,将六种定型的定型的偏差措施,使LEEA倡议在模块化后,最终显示其偏差性。

0

相关内容

语言模型化

语言模型化

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

专知会员服务

26+阅读 · 2020年5月6日

【图神经网络遇上符号计算】Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective

【图神经网络遇上符号计算】Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective

专知会员服务

44+阅读 · 2020年3月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【论文】边缘计算:对当前计划的全面调查和可持续边缘计算发展的路线图（Edge Computing: A Comprehensive Surveyof Current Initiativesand a Roadmap for a Sustainable Edge Computing Development）

【论文】边缘计算:对当前计划的全面调查和可持续边缘计算发展的路线图（Edge Computing: A Comprehensive Surveyof Current Initiativesand a Roadmap for a Sustainable Edge Computing Development）

专知会员服务

29+阅读 · 2019年12月19日

【斯坦福大学】面向可解释人工智能:神经网络的显著性检验（Towards Explainable AI: Significance Tests for Neural Networks），26页pdf

【斯坦福大学】面向可解释人工智能:神经网络的显著性检验（Towards Explainable AI: Significance Tests for Neural Networks），26页pdf

专知会员服务

27+阅读 · 2019年12月19日

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

专知会员服务

26+阅读 · 2019年11月10日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【资源】问答阅读理解资源列表

【资源】问答阅读理解资源列表

专知

3+阅读 · 2020年7月25日

【Github】All4NLP：自然语言处理相关资源整理

【Github】All4NLP：自然语言处理相关资源整理

AINLP

23+阅读 · 2019年8月9日

已删除

将门创投

4+阅读 · 2019年6月5日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

From Intrinsic to Counterfactual: On the Explainability of Contextualized Recommender Systems

Arxiv

0+阅读 · 2021年10月28日

Pre-trained Language Models in Biomedical Domain: A Systematic Survey

Arxiv

10+阅读 · 2021年10月12日

On the Opportunities and Risks of Foundation Models

Arxiv

30+阅读 · 2021年8月18日

Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

Arxiv

9+阅读 · 2021年4月14日

Linked Credibility Reviews for Explainable Misinformation Detection

Arxiv

4+阅读 · 2020年8月28日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Latent Relation Language Models

Arxiv

21+阅读 · 2019年8月21日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings

Arxiv

5+阅读 · 2018年5月10日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

专知会员服务

26+阅读 · 2020年5月6日

【图神经网络遇上符号计算】Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective

【图神经网络遇上符号计算】Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective

专知会员服务

44+阅读 · 2020年3月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【论文】边缘计算:对当前计划的全面调查和可持续边缘计算发展的路线图（Edge Computing: A Comprehensive Surveyof Current Initiativesand a Roadmap for a Sustainable Edge Computing Development）

【论文】边缘计算:对当前计划的全面调查和可持续边缘计算发展的路线图（Edge Computing: A Comprehensive Surveyof Current Initiativesand a Roadmap for a Sustainable Edge Computing Development）

专知会员服务

29+阅读 · 2019年12月19日

【斯坦福大学】面向可解释人工智能:神经网络的显著性检验（Towards Explainable AI: Significance Tests for Neural Networks），26页pdf

【斯坦福大学】面向可解释人工智能:神经网络的显著性检验（Towards Explainable AI: Significance Tests for Neural Networks），26页pdf

专知会员服务

27+阅读 · 2019年12月19日

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

专知会员服务

26+阅读 · 2019年11月10日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【普林斯顿博士论文】以奖励推动生成式人工智能的发展：奖励引导生成的理论与方法

中文版 | 火力支援与巡飞弹药的未来（附原文）

中文版 | 人工智能时代的任务式指挥

扩散模型中的 Transformer：图像生成及其延展应用询问 ChatGPT

相关资讯

【资源】问答阅读理解资源列表

【资源】问答阅读理解资源列表

专知

3+阅读 · 2020年7月25日

【Github】All4NLP：自然语言处理相关资源整理

【Github】All4NLP：自然语言处理相关资源整理

AINLP

23+阅读 · 2019年8月9日

已删除

将门创投

4+阅读 · 2019年6月5日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

相关论文

From Intrinsic to Counterfactual: On the Explainability of Contextualized Recommender Systems

Arxiv

0+阅读 · 2021年10月28日

Pre-trained Language Models in Biomedical Domain: A Systematic Survey

Arxiv

10+阅读 · 2021年10月12日

On the Opportunities and Risks of Foundation Models

Arxiv

30+阅读 · 2021年8月18日

Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

Arxiv

9+阅读 · 2021年4月14日

Linked Credibility Reviews for Explainable Misinformation Detection

Arxiv

4+阅读 · 2020年8月28日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Latent Relation Language Models

Arxiv

21+阅读 · 2019年8月21日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings

Arxiv

5+阅读 · 2018年5月10日

微信扫码咨询专知VIP会员