通过边界勘探破坏适应性数据 (Adaptive Data Debiasing through Bounded Exploration) - 专知论文

会员服务 ·

0

Facebook AI Research · 有偏 · 可约的 · Performer · 统计量 ·

2023 年 1 月 10 日

Adaptive Data Debiasing through Bounded Exploration

翻译：通过边界勘探破坏适应性数据

Yifan Yang,Yang Liu,Parinaz Naghizadeh

from arxiv, NeurIPS 2022

Biases in existing datasets used to train algorithmic decision rules can raise ethical and economic concerns due to the resulting disparate treatment of different groups. We propose an algorithm for sequentially debiasing such datasets through adaptive and bounded exploration in a classification problem with costly and censored feedback. Exploration in this context means that at times, and to a judiciously-chosen extent, the decision maker deviates from its (current) loss-minimizing rule, and instead accepts some individuals that would otherwise be rejected, so as to reduce statistical data biases. Our proposed algorithm includes parameters that can be used to balance between the ultimate goal of removing data biases -- which will in turn lead to more accurate and fair decisions, and the exploration risks incurred to achieve this goal. We analytically show that such exploration can help debias data in certain distributions. We further investigate how fairness criteria can work in conjunction with our data debiasing algorithm. We illustrate the performance of our algorithm using experiments on synthetic and real-world datasets.

翻译：用于培训算法决定规则的现有数据集中的比值可能会引起伦理和经济问题,因为由此产生的不同对待不同群体的结果不同。我们提出一种算法,通过在分类问题中以昂贵和受审查的反馈进行适应性和约束性探索,从而按顺序降低这类数据集的偏差。在这方面的探索意味着,有时,并且为了明智地选择,决策者偏离了其(当前)损失最小化规则,而是接受一些否则会被拒绝的个人,以减少统计数据偏差。我们提议的算法包括一些参数,这些参数可以用来平衡消除数据偏差的最终目标 -- -- 这反过来将导致更准确和公正的决定,以及实现这一目标的勘探风险。我们分析表明,这种探索可以帮助某些分布中的数据偏差。我们进一步调查公平标准如何与我们的数据偏差算法相结合。我们用合成和真实世界数据集的实验来说明我们的算法的运作情况。

0

相关内容

Facebook AI Research

Facebook AI Research

Facebook AI Research

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

氮化铌铝/氮化硅双相复合涂层超硬机制和热稳定性研究

国家自然科学基金

0+阅读 · 2014年12月31日

突发事件下人车混合疏散行为及应急疏导策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

调和分析及其应用

国家自然科学基金

0+阅读 · 2013年12月31日

异构酶Pin1启动子多态性与鼻咽癌的关联分析及其表达调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞型朊蛋白PrPC在睡眠剥夺与AD发病间发挥关联作用的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

图的双临猜想及相关的着色问题

国家自然科学基金

0+阅读 · 2011年12月31日

地下水耦合模型的有限元方法及反演

国家自然科学基金

0+阅读 · 2011年12月31日

Tau蛋白异常对海马神经环路的影响及其在阿尔茨海默病记忆障碍发生发展中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Refined Pseudo labeling for Source-free Domain Adaptive Object Detection

Arxiv

0+阅读 · 2023年3月7日

FIT: Frequency-based Image Translation for Domain Adaptive Object Detection

Arxiv

0+阅读 · 2023年3月7日

Robust Dominant Periodicity Detection for Time Series with Missing Data

Arxiv

0+阅读 · 2023年3月6日

Large-Scale Exploration of Cave Environments by Unmanned Aerial Vehicles

Arxiv

0+阅读 · 2023年3月6日

Robustness, Evaluation and Adaptation of Machine Learning Models in the Wild

Arxiv

7+阅读 · 2023年3月5日

A Multi-Agent Adaptive Deep Learning Framework for Online Intrusion Detection

Arxiv

0+阅读 · 2023年3月5日

Social Bias Meets Data Bias: The Impacts of Labeling and Measurement Errors on Fairness Criteria

Arxiv

0+阅读 · 2023年3月5日

Adaptive Spatial Sampling Design for Environmental Field Prediction using Low-Cost Sensing Technologies

Arxiv

0+阅读 · 2023年3月3日

Rate adaptive estimation of the center of a symmetric distribution

Arxiv

0+阅读 · 2023年3月3日

Guarded Policy Optimization with Imperfect Online Demonstrations

Arxiv

0+阅读 · 2023年3月3日

VIP会员

文章信息

相关主题

Facebook AI Research

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Refined Pseudo labeling for Source-free Domain Adaptive Object Detection

Arxiv

0+阅读 · 2023年3月7日

FIT: Frequency-based Image Translation for Domain Adaptive Object Detection

Arxiv

0+阅读 · 2023年3月7日

Robust Dominant Periodicity Detection for Time Series with Missing Data

Arxiv

0+阅读 · 2023年3月6日

Large-Scale Exploration of Cave Environments by Unmanned Aerial Vehicles

Arxiv

0+阅读 · 2023年3月6日

Robustness, Evaluation and Adaptation of Machine Learning Models in the Wild

Arxiv

7+阅读 · 2023年3月5日

A Multi-Agent Adaptive Deep Learning Framework for Online Intrusion Detection

Arxiv

0+阅读 · 2023年3月5日

Social Bias Meets Data Bias: The Impacts of Labeling and Measurement Errors on Fairness Criteria

Arxiv

0+阅读 · 2023年3月5日

Adaptive Spatial Sampling Design for Environmental Field Prediction using Low-Cost Sensing Technologies

Arxiv

0+阅读 · 2023年3月3日

Rate adaptive estimation of the center of a symmetric distribution

Arxiv

0+阅读 · 2023年3月3日

Guarded Policy Optimization with Imperfect Online Demonstrations

Arxiv

0+阅读 · 2023年3月3日

相关基金

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

氮化铌铝/氮化硅双相复合涂层超硬机制和热稳定性研究

国家自然科学基金

0+阅读 · 2014年12月31日

突发事件下人车混合疏散行为及应急疏导策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

调和分析及其应用

国家自然科学基金

0+阅读 · 2013年12月31日

异构酶Pin1启动子多态性与鼻咽癌的关联分析及其表达调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞型朊蛋白PrPC在睡眠剥夺与AD发病间发挥关联作用的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

图的双临猜想及相关的着色问题

国家自然科学基金

0+阅读 · 2011年12月31日

地下水耦合模型的有限元方法及反演

国家自然科学基金

0+阅读 · 2011年12月31日

Tau蛋白异常对海马神经环路的影响及其在阿尔茨海默病记忆障碍发生发展中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员