Astraea:基于语法的公平性测试 (Astraea: Grammar-based Fairness Testing) - 专知论文

会员服务 ·

0

Facebook AI Research · 有偏 · Processing（编程语言） · ML · CASES ·

2022 年 1 月 10 日

Astraea: Grammar-based Fairness Testing

翻译：Astraea:基于语法的公平性测试

Ezekiel Soremekun,Sakshi Udeshi,Sudipta Chattopadhyay

from arxiv, IEEE Transactions on Software Engineering (2022)

Software often produces biased outputs. In particular, machine learning (ML) based software are known to produce erroneous predictions when processing discriminatory inputs. Such unfair program behavior can be caused by societal bias. In the last few years, Amazon, Microsoft and Google have provided software services that produce unfair outputs, mostly due to societal bias (e.g. gender or race). In such events, developers are saddled with the task of conducting fairness testing. Fairness testing is challenging; developers are tasked with generating discriminatory inputs that reveal and explain biases. We propose a grammar-based fairness testing approach (called ASTRAEA) which leverages context-free grammars to generate discriminatory inputs that reveal fairness violations in software systems. Using probabilistic grammars, ASTRAEA also provides fault diagnosis by isolating the cause of observed software bias. ASTRAEA's diagnoses facilitate the improvement of ML fairness. ASTRAEA was evaluated on 18 software systems that provide three major natural language processing (NLP) services. In our evaluation, ASTRAEA generated fairness violations with a rate of ~18%. ASTRAEA generated over 573K discriminatory test cases and found over 102K fairness violations. Furthermore, ASTRAEA improves software fairness by ~76%, via model-retraining.

翻译：特别是,基于机器的学习(ML)软件在处理歧视性投入时已知会产生错误的预测。这种不公平的方案行为可能由社会偏见造成。在过去几年里,亚马逊、微软和谷歌提供了产生不公平产出的软件服务,主要是由于社会偏见(如性别或种族),在这些活动中,开发者承担着进行公平测试的任务。公平测试具有挑战性;开发者的任务是提供显示和解释偏见的歧视性投入。我们建议采用基于语法的公平测试方法(称为ASTRAEA),利用无背景语法生成歧视性投入,以揭示软件系统中的不公平现象。使用概率语法、ASTRAEA还提供错误诊断,将观察到的软件偏差的原因隔离开来。ASTRAEA的诊断有助于改善ML的公平性。ASTRAEA在提供三种主要自然语言处理(NLP)服务的18个软件系统中进行了评估。我们的评估是,ASTRAEA产生了一种高达~18 %的公平性侵犯率。ASTRA在573K的歧视性测试案件中产生了573K的歧视性案例,并且发现通过102K的公平性培训改进了ABRA的公平性。

1

相关内容

Facebook AI Research

Facebook AI Research

Facebook AI Research

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

北京内推 | 微软亚洲互联网工程院S+D Science Team招聘NLP研究员/实习生

北京内推 | 微软亚洲互联网工程院S+D Science Team招聘NLP研究员/实习生

PaperWeekly

0+阅读 · 2022年2月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于散射点密度信息熵的层析SAR建筑三维重建新方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于“骨肉不相亲”理论探讨壮骨方通过瘦素、Ghrelin、肽YY干预老年性骨质疏松小鼠作用机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于信息熵和DCS的多基线SAR干涉理论与新方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

量子群与Tewilliger代数的相关问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于变分结构纹理分解的超分辨率图像复原方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ti2AlC基材料合成热力学及高温稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

多变量IB方法及算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

新疆狗牙根抗寒分子标记的开发及抗寒基因克隆研究

国家自然科学基金

0+阅读 · 2011年12月31日

利用定量构效关系模型研究抗氧化肽构效关系

国家自然科学基金

0+阅读 · 2009年12月31日

BugListener: Identifying and Synthesizing Bug Reports from Collaborative Live Chats

Arxiv

0+阅读 · 2022年4月20日

Using a Semantic Knowledge Base to Improve the Management of Security Reports in Industrial DevOps Projects

Arxiv

0+阅读 · 2022年4月19日

GAMMA Challenge:Glaucoma grAding from Multi-Modality imAges

Arxiv

0+阅读 · 2022年4月19日

Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning

Arxiv

0+阅读 · 2022年4月19日

FairFed: Enabling Group Fairness in Federated Learning

Arxiv

0+阅读 · 2022年4月18日

Pathologies of Pre-trained Language Models in Few-shot Fine-tuning

Arxiv

1+阅读 · 2022年4月17日

SimMIM: A Simple Framework for Masked Image Modeling

Arxiv

1+阅读 · 2022年4月17日

Open Domain Generalization with Domain-Augmented Meta-Learning

Arxiv

21+阅读 · 2021年4月8日

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Arxiv

11+阅读 · 2020年5月8日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

VIP会员

文章信息

相关主题

Facebook AI Research

Processing（编程语言）

相关VIP内容

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】用于提升含优化层学习的算法与体系结构

【NeurIPS2025】有何不同于过去？基于自监督偏差学习的时空时间序列预测

超越决策优势：情报在创新与适应中的作用

量子计算发展态势研究报告（2025年）

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

北京内推 | 微软亚洲互联网工程院S+D Science Team招聘NLP研究员/实习生

北京内推 | 微软亚洲互联网工程院S+D Science Team招聘NLP研究员/实习生

PaperWeekly

0+阅读 · 2022年2月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

BugListener: Identifying and Synthesizing Bug Reports from Collaborative Live Chats

Arxiv

0+阅读 · 2022年4月20日

Using a Semantic Knowledge Base to Improve the Management of Security Reports in Industrial DevOps Projects

Arxiv

0+阅读 · 2022年4月19日

GAMMA Challenge:Glaucoma grAding from Multi-Modality imAges

Arxiv

0+阅读 · 2022年4月19日

Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning

Arxiv

0+阅读 · 2022年4月19日

FairFed: Enabling Group Fairness in Federated Learning

Arxiv

0+阅读 · 2022年4月18日

Pathologies of Pre-trained Language Models in Few-shot Fine-tuning

Arxiv

1+阅读 · 2022年4月17日

SimMIM: A Simple Framework for Masked Image Modeling

Arxiv

1+阅读 · 2022年4月17日

Open Domain Generalization with Domain-Augmented Meta-Learning

Arxiv

21+阅读 · 2021年4月8日

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Arxiv

11+阅读 · 2020年5月8日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

相关基金

基于散射点密度信息熵的层析SAR建筑三维重建新方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于“骨肉不相亲”理论探讨壮骨方通过瘦素、Ghrelin、肽YY干预老年性骨质疏松小鼠作用机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于信息熵和DCS的多基线SAR干涉理论与新方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

量子群与Tewilliger代数的相关问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于变分结构纹理分解的超分辨率图像复原方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ti2AlC基材料合成热力学及高温稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

多变量IB方法及算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

新疆狗牙根抗寒分子标记的开发及抗寒基因克隆研究

国家自然科学基金

0+阅读 · 2011年12月31日

利用定量构效关系模型研究抗氧化肽构效关系

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员