评估缺失数据存在的公平性 (Assessing Fairness in the Presence of Missing Data) - 专知论文

会员服务 ·

0

Facebook AI Research · CASES · 估计/估计量 · 估计误差 · CASE ·

2021 年 12 月 7 日

Assessing Fairness in the Presence of Missing Data

翻译：评估缺失数据存在的公平性

Yiliang Zhang,Qi Long

Missing data are prevalent and present daunting challenges in real data analysis. While there is a growing body of literature on fairness in analysis of fully observed data, there has been little theoretical work on investigating fairness in analysis of incomplete data. In practice, a popular analytical approach for dealing with missing data is to use only the set of complete cases, i.e., observations with all features fully observed to train a prediction algorithm. However, depending on the missing data mechanism, the distribution of complete cases and the distribution of the complete data may be substantially different. When the goal is to develop a fair algorithm in the complete data domain where there are no missing values, an algorithm that is fair in the complete case domain may show disproportionate bias towards some marginalized groups in the complete data domain. To fill this significant gap, we study the problem of estimating fairness in the complete data domain for an arbitrary model evaluated merely using complete cases. We provide upper and lower bounds on the fairness estimation error and conduct numerical experiments to assess our theoretical results. Our work provides the first known theoretical results on fairness guarantee in analysis of incomplete data.

翻译：在实际数据分析中,缺少的数据十分普遍,而且构成严峻的挑战。虽然关于分析充分观察到的数据的公正性的文献越来越多,但在调查分析不完全数据方面的公正性方面却很少进行理论工作。在实践中,处理缺失数据的流行分析方法是只使用一套完整的案例,即所有特征都完全观察到的观察来训练预测算法。然而,根据缺失的数据机制,完整案例的分布和完整数据的分配可能大不相同。在完全数据领域没有缺失值的情况下,目标是在完整数据领域发展一种公平的算法,而在完整数据领域,一种公平的算法可能显示在完全数据领域对某些边缘化群体存在不相称的偏向。为填补这一重大空白,我们研究在完全数据领域对仅使用完整案例来评价的任意模型估计公正性的问题。我们提供了公平估计错误的上限和下限,并进行数字实验来评估我们的理论结果。我们的工作在分析不完全数据时,在公平性保障方面提供了第一个已知的理论结果。

0

相关内容

Facebook AI Research

Facebook AI Research

Facebook AI Research

【数据科学导论书】Introduction to Datascience，253页pdf

【数据科学导论书】Introduction to Datascience，253页pdf

专知会员服务

50+阅读 · 2021年11月15日

【KDD2021】图神经网络，NUS- Xavier Bresson教授

【KDD2021】图神经网络，NUS- Xavier Bresson教授

专知会员服务

67+阅读 · 2021年8月20日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

【2020新书】C++20 特性第二版，A Problem-Solution Approach

【2020新书】C++20 特性第二版，A Problem-Solution Approach

专知会员服务

60+阅读 · 2020年4月26日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【报告推荐】线上食品推荐中的数据分析（Computational Data Analytics on the Web for Better Food Decision Making）

【报告推荐】线上食品推荐中的数据分析（Computational Data Analytics on the Web for Better Food Decision Making）

专知会员服务

16+阅读 · 2019年10月2日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

已删除

将门创投

4+阅读 · 2018年6月26日

Fair SA: Sensitivity Analysis for Fairness in Face Recognition

Arxiv

0+阅读 · 2022年2月9日

Improving Fairness via Federated Learning

Arxiv

0+阅读 · 2022年2月9日

PrivFair: a Library for Privacy-Preserving Fairness Auditing

PrivFair: a Library for Privacy-Preserving Fairness Auditing

Arxiv

0+阅读 · 2022年2月9日

Long-Term Resource Allocation Fairness in Average Markov Decision Process (AMDP) Environment

Arxiv

0+阅读 · 2022年2月8日

Group Fairness Is Not Derivable From Justice: a Mathematical Proof

Arxiv

0+阅读 · 2022年2月8日

Towards an Analytical Definition of Sufficient Data

Towards an Analytical Definition of Sufficient Data

Arxiv

0+阅读 · 2022年2月7日

Dikaios: Privacy Auditing of Algorithmic Fairness via Attribute Inference Attacks

Arxiv

0+阅读 · 2022年2月4日

Causal Understanding of Fake News Dissemination on Social Media

Arxiv

8+阅读 · 2021年7月14日

The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

Arxiv

5+阅读 · 2020年12月21日

Equity of Attention: Amortizing Individual Fairness in Rankings

Arxiv

4+阅读 · 2018年5月4日

VIP会员

文章信息

相关主题

Facebook AI Research

估计/估计量

相关VIP内容

【数据科学导论书】Introduction to Datascience，253页pdf

【数据科学导论书】Introduction to Datascience，253页pdf

专知会员服务

50+阅读 · 2021年11月15日

【KDD2021】图神经网络，NUS- Xavier Bresson教授

【KDD2021】图神经网络，NUS- Xavier Bresson教授

专知会员服务

67+阅读 · 2021年8月20日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

【2020新书】C++20 特性第二版，A Problem-Solution Approach

【2020新书】C++20 特性第二版，A Problem-Solution Approach

专知会员服务

60+阅读 · 2020年4月26日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【报告推荐】线上食品推荐中的数据分析（Computational Data Analytics on the Web for Better Food Decision Making）

【报告推荐】线上食品推荐中的数据分析（Computational Data Analytics on the Web for Better Food Decision Making）

专知会员服务

16+阅读 · 2019年10月2日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACMMM2025教程】打击网络虚假信息视频：特征分析、检测与防范，170页ppt

海军无人系统：海上作战的演进而非革命

Nature 子刊 | SciToolAgent:知识图谱引导的科学工具智能体

多媒体顶会ACM Multimedia 2025各大奖项揭晓！格拉斯哥大学等获最佳论文，中科院自动化所等获最佳学生论文

相关资讯

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

已删除

将门创投

4+阅读 · 2018年6月26日

相关论文

Fair SA: Sensitivity Analysis for Fairness in Face Recognition

Arxiv

0+阅读 · 2022年2月9日

Improving Fairness via Federated Learning

Arxiv

0+阅读 · 2022年2月9日

PrivFair: a Library for Privacy-Preserving Fairness Auditing

PrivFair: a Library for Privacy-Preserving Fairness Auditing

Arxiv

0+阅读 · 2022年2月9日

Long-Term Resource Allocation Fairness in Average Markov Decision Process (AMDP) Environment

Arxiv

0+阅读 · 2022年2月8日

Group Fairness Is Not Derivable From Justice: a Mathematical Proof

Arxiv

0+阅读 · 2022年2月8日

Towards an Analytical Definition of Sufficient Data

Towards an Analytical Definition of Sufficient Data

Arxiv

0+阅读 · 2022年2月7日

Dikaios: Privacy Auditing of Algorithmic Fairness via Attribute Inference Attacks

Arxiv

0+阅读 · 2022年2月4日

Causal Understanding of Fake News Dissemination on Social Media

Arxiv

8+阅读 · 2021年7月14日

The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

Arxiv

5+阅读 · 2020年12月21日

Equity of Attention: Amortizing Individual Fairness in Rankings

Arxiv

4+阅读 · 2018年5月4日

微信扫码咨询专知VIP会员