Dbias:在新闻文章中发现偏见和确保公平 (Dbias: Detecting biases and ensuring Fairness in news articles) - 专知论文

会员服务 ·

0

有偏 · Extensibility · Facebook AI Research · Attention · Performer ·

2022 年 8 月 11 日

Dbias: Detecting biases and ensuring Fairness in news articles

翻译：Dbias:在新闻文章中发现偏见和确保公平

Shaina Raza,Deepak John Reji,Chen Ding

from arxiv, Accepted for publication in International Journal of Data Science and Analytics

Because of the increasing use of data-centric systems and algorithms in machine learning, the topic of fairness is receiving a lot of attention in the academic and broader literature. This paper introduces Dbias (https://pypi.org/project/Dbias/), an open-source Python package for ensuring fairness in news articles. Dbias can take any text to determine if it is biased. Then, it detects biased words in the text, masks them, and suggests a set of sentences with new words that are bias-free or at least less biased. We conduct extensive experiments to assess the performance of Dbias. To see how well our approach works, we compare it to the existing fairness models. We also test the individual components of Dbias to see how effective they are. The experimental results show that Dbias outperforms all the baselines in terms of accuracy and fairness. We make this package (Dbias) as publicly available for the developers and practitioners to mitigate biases in textual data (such as news articles), as well as to encourage extension of this work.

翻译：由于在机器学习中越来越多地使用以数据为中心的系统和算法,公平问题在学术和更广泛的文献中正受到大量关注。本文介绍Dbias (https://pypi.org/project/Dbias/Dbias/),这是一个公开源代码的Python软件包,以确保新闻文章的公平性。Dbias可以使用任何文本来确定它是否带有偏向性。然后,它发现文本中带有偏见的词句,遮盖它们,并提出一套带有无偏见或至少不那么偏颇的新词的句子。我们进行了广泛的实验,以评估Dbias的性能。为了了解我们的方法如何运作,我们将它与现有的公平模式进行比较。我们还测试Dbias的各个组成部分,看它们是否有效。实验结果表明,Dbias在准确性和公平性方面超越了所有基线。我们向开发者和从业人员公开提供这一软件包(Dbias),以减轻文字数据(例如新闻文章)中的偏差,并鼓励扩大这项工作。

0

相关内容

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Ag@生物类酯与醋酸纤维素纳米结构阵列薄膜对多氯联苯的SERS活性

国家自然科学基金

0+阅读 · 2015年12月31日

基于贵金属-稀土掺杂氧化锌异质结纳米阵列发光薄膜的协同增强机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

胺基功能化石墨烯量子点的大规模制备及其高电容三维电极的构筑

国家自然科学基金

0+阅读 · 2014年12月31日

基于改变Michael加成反应活性的策略研究新型可克服耐药的EGFR[T790M]靶向小分子抑制剂

国家自然科学基金

0+阅读 · 2013年12月31日

三维同轴纳米管阵列(Co-Ni硫化物@过渡金属氢氧化物)电极的构筑与超电容特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

硫化物/石墨烯纳米复合材料的室温固相合成及光催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

p53激活剂RITA及其类似物的设计、合成及抗肿瘤活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

NF-κB和Nrf2-ARE信号通路调控CdTe量子点氧化损伤作用的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Nrf2-ARE通路在缺血/药物后处理中作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Anomaly Detection for a Large Number of Streams: A Permutation-Based Higher Criticism Approach

Arxiv

0+阅读 · 2022年10月6日

Towards a Fair Comparison and Realistic Evaluation Framework of Android Malware Detectors based on Static Analysis and Machine Learning

Arxiv

0+阅读 · 2022年10月6日

On the Use of Deep Learning in Software Defect Prediction

Arxiv

0+阅读 · 2022年10月5日

SVEva Fair: A Framework for Evaluating Fairness in Speaker Verification

Arxiv

0+阅读 · 2022年10月4日

Predictability and Surprise in Large Generative Models

Arxiv

1+阅读 · 2022年10月3日

Assessing the impact of contextual information in hate speech detection

Arxiv

0+阅读 · 2022年10月2日

Out-of-Distribution Detection and Selective Generation for Conditional Language Models

Arxiv

0+阅读 · 2022年9月30日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

VIP会员

文章信息

相关主题

Facebook AI Research

相关VIP内容

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【普林斯顿博士论文】在线学习：优化、控制与学习理论

不确定环境下无人机三维路径规划研究 | 221页

【NeurIPS2025】《LeapFactual：基于条件流匹配的可靠视觉反事实解释》

大语言模型将如何改变军事指挥结构

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Anomaly Detection for a Large Number of Streams: A Permutation-Based Higher Criticism Approach

Arxiv

0+阅读 · 2022年10月6日

Towards a Fair Comparison and Realistic Evaluation Framework of Android Malware Detectors based on Static Analysis and Machine Learning

Arxiv

0+阅读 · 2022年10月6日

On the Use of Deep Learning in Software Defect Prediction

Arxiv

0+阅读 · 2022年10月5日

SVEva Fair: A Framework for Evaluating Fairness in Speaker Verification

Arxiv

0+阅读 · 2022年10月4日

Predictability and Surprise in Large Generative Models

Arxiv

1+阅读 · 2022年10月3日

Assessing the impact of contextual information in hate speech detection

Arxiv

0+阅读 · 2022年10月2日

Out-of-Distribution Detection and Selective Generation for Conditional Language Models

Arxiv

0+阅读 · 2022年9月30日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

相关基金

Ag@生物类酯与醋酸纤维素纳米结构阵列薄膜对多氯联苯的SERS活性

国家自然科学基金

0+阅读 · 2015年12月31日

基于贵金属-稀土掺杂氧化锌异质结纳米阵列发光薄膜的协同增强机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

胺基功能化石墨烯量子点的大规模制备及其高电容三维电极的构筑

国家自然科学基金

0+阅读 · 2014年12月31日

基于改变Michael加成反应活性的策略研究新型可克服耐药的EGFR[T790M]靶向小分子抑制剂

国家自然科学基金

0+阅读 · 2013年12月31日

三维同轴纳米管阵列(Co-Ni硫化物@过渡金属氢氧化物)电极的构筑与超电容特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

硫化物/石墨烯纳米复合材料的室温固相合成及光催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

p53激活剂RITA及其类似物的设计、合成及抗肿瘤活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

NF-κB和Nrf2-ARE信号通路调控CdTe量子点氧化损伤作用的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Nrf2-ARE通路在缺血/药物后处理中作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员