社会偏见遭遇数据偏见：标记和测量误差对公平性标准的影响 (Social Bias Meets Data Bias: The Impacts of Labeling and Measurement Errors on Fairness Criteria) - 专知论文

会员服务 ·

0

公平性 · 偏差 · 测量误差 · 稳健 · 数据集 ·

2023 年 4 月 10 日

Social Bias Meets Data Bias: The Impacts of Labeling and Measurement Errors on Fairness Criteria

翻译：社会偏见遭遇数据偏见：标记和测量误差对公平性标准的影响

Yiqiao Liao,Parinaz Naghizadeh

Although many fairness criteria have been proposed to ensure that machine learning algorithms do not exhibit or amplify our existing social biases, these algorithms are trained on datasets that can themselves be statistically biased. In this paper, we investigate the robustness of a number of existing (demographic) fairness criteria when the algorithm is trained on biased data. We consider two forms of dataset bias: errors by prior decision makers in the labeling process, and errors in measurement of the features of disadvantaged individuals. We analytically show that some constraints (such as Demographic Parity) can remain robust when facing certain statistical biases, while others (such as Equalized Odds) are significantly violated if trained on biased data. We also analyze the sensitivity of these criteria and the decision maker's utility to biases. We provide numerical experiments based on three real-world datasets (the FICO, Adult, and German credit score datasets) supporting our analytical findings. Our findings present an additional guideline for choosing among existing fairness criteria, or for proposing new criteria, when available datasets may be biased.

翻译：尽管已经提出了许多公平性标准以确保机器学习算法不会表现出或放大我们现有的社会偏见，但这些算法是在可能存在统计偏差的数据集上训练的。在本文中，我们调查了在算法在偏斜数据上训练时一些现有（人口统计学）公平性标准的稳健性。我们考虑数据集偏差的两种形式：标记误差和衡量不利地位个体特征的误差。我们从理论上证明了在面临某些统计偏差时，某些约束（如人口统计学平等）可以保持稳健，而其他约束（如等时机）则会在训练偏斜数据时显著违反。我们还分析了这些标准和决策者效用对偏差的敏感性。我们提供了基于三个真实世界数据集的数值实验，支持我们的理论发现。我们的研究结果为在数据集可能存在偏差的情况下选择现有公平性标准或提出新标准提供了额外的指导。

0

相关内容

公平性

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

2022最新MIT成果【ICML 2022】：一种提高人工智能的公平性和准确性的技术，Selective Regression Under Fairness Criteria

2022最新MIT成果【ICML 2022】：一种提高人工智能的公平性和准确性的技术，Selective Regression Under Fairness Criteria

专知会员服务

10+阅读 · 2022年7月26日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

【WWW2021】动态排序学习最大化边际公平性

专知会员服务

13+阅读 · 2021年3月13日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【KDD2020】具有条件公平性的算法决策，Algorithmic Decision Making with Conditional Fairness

【KDD2020】具有条件公平性的算法决策，Algorithmic Decision Making with Conditional Fairness

专知会员服务

22+阅读 · 2020年6月19日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

【哈佛大学】机器学习的层次局限性，A Hierarchy of Limitations in Machine Learning

【哈佛大学】机器学习的层次局限性，A Hierarchy of Limitations in Machine Learning

专知会员服务

47+阅读 · 2020年2月12日

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

专知会员服务

13+阅读 · 2019年12月27日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

专知

14+阅读 · 2018年3月28日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

基于在轨测量误差补偿的卫星高精度姿态确定方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

国际贸易与环境：因果效应、机制及对策研究

国家自然科学基金

0+阅读 · 2015年12月31日

云环境多用户的情景化动态信任决策模型及算法的研究

国家自然科学基金

0+阅读 · 2015年12月31日

信念偏差效应的认知神经机制

国家自然科学基金

1+阅读 · 2012年12月31日

精神分裂症超高危人群经颅磁刺激干预的磁共振研究

国家自然科学基金

0+阅读 · 2012年12月31日

中国儿童合理用药评价指标体系的循证研究

国家自然科学基金

0+阅读 · 2012年12月31日

社会情境影响公平感知和社会决策的认知神经机制

国家自然科学基金

1+阅读 · 2011年12月31日

压缩采样框架下的自适应稀疏信号感知与重建

国家自然科学基金

0+阅读 · 2009年12月31日

分析检出限的实验测量和验证方法的研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

Markov Network for Modeling Local Item Dependence in Cognitively Diagnostic Classification Models

Arxiv

0+阅读 · 2023年5月26日

Counterfactual Probing for the influence of affect and specificity on Intergroup Bias

Arxiv

0+阅读 · 2023年5月25日

Dynamic Inter-treatment Information Sharing for Heterogeneous Treatment Effects Estimation

Arxiv

0+阅读 · 2023年5月25日

Monitoring Algorithmic Fairness

Arxiv

0+阅读 · 2023年5月25日

How Graph Convolutions Amplify Popularity Bias for Recommendation?

Arxiv

0+阅读 · 2023年5月24日

Towards Optimizing Storage Costs on the Cloud

Arxiv

0+阅读 · 2023年5月24日

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Arxiv

15+阅读 · 2021年1月21日

Knowledge-aware Graph Neural Networks with Label Smoothness Regularization for Recommendation

Arxiv

11+阅读 · 2019年6月13日

Knowledge Graph Convolutional Networks for Recommender Systems with Label Smoothness Regularization

Arxiv

21+阅读 · 2019年5月11日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

2022最新MIT成果【ICML 2022】：一种提高人工智能的公平性和准确性的技术，Selective Regression Under Fairness Criteria

2022最新MIT成果【ICML 2022】：一种提高人工智能的公平性和准确性的技术，Selective Regression Under Fairness Criteria

专知会员服务

10+阅读 · 2022年7月26日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

【WWW2021】动态排序学习最大化边际公平性

专知会员服务

13+阅读 · 2021年3月13日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【KDD2020】具有条件公平性的算法决策，Algorithmic Decision Making with Conditional Fairness

【KDD2020】具有条件公平性的算法决策，Algorithmic Decision Making with Conditional Fairness

专知会员服务

22+阅读 · 2020年6月19日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

【哈佛大学】机器学习的层次局限性，A Hierarchy of Limitations in Machine Learning

【哈佛大学】机器学习的层次局限性，A Hierarchy of Limitations in Machine Learning

专知会员服务

47+阅读 · 2020年2月12日

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

专知会员服务

13+阅读 · 2019年12月27日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

专知

14+阅读 · 2018年3月28日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Markov Network for Modeling Local Item Dependence in Cognitively Diagnostic Classification Models

Arxiv

0+阅读 · 2023年5月26日

Counterfactual Probing for the influence of affect and specificity on Intergroup Bias

Arxiv

0+阅读 · 2023年5月25日

Dynamic Inter-treatment Information Sharing for Heterogeneous Treatment Effects Estimation

Arxiv

0+阅读 · 2023年5月25日

Monitoring Algorithmic Fairness

Arxiv

0+阅读 · 2023年5月25日

How Graph Convolutions Amplify Popularity Bias for Recommendation?

Arxiv

0+阅读 · 2023年5月24日

Towards Optimizing Storage Costs on the Cloud

Arxiv

0+阅读 · 2023年5月24日

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Arxiv

15+阅读 · 2021年1月21日

Knowledge-aware Graph Neural Networks with Label Smoothness Regularization for Recommendation

Arxiv

11+阅读 · 2019年6月13日

Knowledge Graph Convolutional Networks for Recommender Systems with Label Smoothness Regularization

Arxiv

21+阅读 · 2019年5月11日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

基于在轨测量误差补偿的卫星高精度姿态确定方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

国际贸易与环境：因果效应、机制及对策研究

国家自然科学基金

0+阅读 · 2015年12月31日

云环境多用户的情景化动态信任决策模型及算法的研究

国家自然科学基金

0+阅读 · 2015年12月31日

信念偏差效应的认知神经机制

国家自然科学基金

1+阅读 · 2012年12月31日

精神分裂症超高危人群经颅磁刺激干预的磁共振研究

国家自然科学基金

0+阅读 · 2012年12月31日

中国儿童合理用药评价指标体系的循证研究

国家自然科学基金

0+阅读 · 2012年12月31日

社会情境影响公平感知和社会决策的认知神经机制

国家自然科学基金

1+阅读 · 2011年12月31日

压缩采样框架下的自适应稀疏信号感知与重建

国家自然科学基金

0+阅读 · 2009年12月31日

分析检出限的实验测量和验证方法的研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员