iFlipper: 个人公平标签 (iFlipper: Label Flipping for Individual Fairness) - 专知论文

会员服务 ·

0

Facebook AI Research · 翻转 · 优化器 · 标注 · 线性的 ·

2022 年 9 月 15 日

iFlipper: Label Flipping for Individual Fairness

翻译：iFlipper: 个人公平标签

Hantian Zhang,Ki Hyun Tae,Jaeyoung Park,Xu Chu,Steven Euijong Whang

from arxiv, 20 pages, 19 figures, 8 tables

As machine learning becomes prevalent, mitigating any unfairness present in the training data becomes critical. Among the various notions of fairness, this paper focuses on the well-known individual fairness, which states that similar individuals should be treated similarly. While individual fairness can be improved when training a model (in-processing), we contend that fixing the data before model training (pre-processing) is a more fundamental solution. In particular, we show that label flipping is an effective pre-processing technique for improving individual fairness. Our system iFlipper solves the optimization problem of minimally flipping labels given a limit to the individual fairness violations, where a violation occurs when two similar examples in the training data have different labels. We first prove that the problem is NP-hard. We then propose an approximate linear programming algorithm and provide theoretical guarantees on how close its result is to the optimal solution in terms of the number of label flips. We also propose techniques for making the linear programming solution more optimal without exceeding the violations limit. Experiments on real datasets show that iFlipper significantly outperforms other pre-processing baselines in terms of individual fairness and accuracy on unseen test sets. In addition, iFlipper can be combined with in-processing techniques for even better results.

翻译：由于机器学习变得普遍,减少培训数据中存在的不公平现象变得至关重要。在各种公平概念中,本文件侧重于众所周知的个人公平,指出类似的个人应受到类似的对待。虽然在培训模型(在处理中)时个人公平性可以提高,但我们认为,在模型培训(预处理)之前确定数据是一个更根本的解决办法。特别是,我们表明,标签翻转是一种提高个人公平性的有效预处理技术。我们的系统iFlipper解决了最小翻转标签的最优化问题,因为对个人公平性有限制,培训数据中两个类似的例子都有不同的标签,出现违反情况。我们首先证明问题很严重。我们然后提出大致线性编程算法,从理论上保证其结果在标签翻转数方面接近最佳解决办法。我们还提出了使线性编程解决方案在不超出违规限度的情况下更优化的技术。对真实数据集的实验显示,iFlipper在个人公平性和准确性方面大大优于其他预处理前基线。此外,在秘密处理技术中,iFlipper可以使用更好的结果。

0

相关内容

Facebook AI Research

Facebook AI Research

Facebook AI Research

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

microRNA与新生鼠缺氧缺血脑损伤的细胞自噬调控

国家自然科学基金

0+阅读 · 2013年12月31日

准一维自旋链与自旋梯子共存Sr14Cu24O41体系电荷有序行为的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

流形上的Bakry-Emery曲率，泛函不等式和热核分析

国家自然科学基金

0+阅读 · 2012年12月31日

流形上整体几何与几何分析的若干研究

国家自然科学基金

0+阅读 · 2012年12月31日

Erbin介导细胞周期异常与肿瘤发生的关系

国家自然科学基金

0+阅读 · 2012年12月31日

一类拟线性Schrodinger方程(组)解的存在性和集中现象研究

国家自然科学基金

0+阅读 · 2012年12月31日

考虑班轮公司和货代公司委托代理关系的集装箱调度管理

国家自然科学基金

0+阅读 · 2011年12月31日

动脉粥样硬化中PPARγ19978;调c-Ski的机制及作用研究

国家自然科学基金

0+阅读 · 2010年12月31日

Learning Individual Treatment Effects under Heterogeneous Interference in Networks

Arxiv

0+阅读 · 2022年10月25日

A Task Allocation Framework for Human Multi-Robot Collaborative Settings

Arxiv

0+阅读 · 2022年10月25日

GlobalFlowNet: Video Stabilization using Deep Distilled Global Motion Estimates

Arxiv

0+阅读 · 2022年10月25日

Ground-Truth Labels Matter: A Deeper Look into Input-Label Demonstrations

Ground-Truth Labels Matter: A Deeper Look into Input-Label Demonstrations

Arxiv

0+阅读 · 2022年10月24日

Simultaneous Improvement of ML Model Fairness and Performance by Identifying Bias in Data

Arxiv

0+阅读 · 2022年10月24日

Clustering with fair-center representation: parameterized approximation algorithms and heuristics

Arxiv

0+阅读 · 2022年10月24日

Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems

Arxiv

0+阅读 · 2022年10月22日

NESTANets: Stable, accurate and efficient neural networks for analysis-sparse inverse problems

Arxiv

0+阅读 · 2022年10月20日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

Facebook AI Research

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

模型提取攻击与防御的系统综述：最新进展与展望

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

【CMU博士论文】用于物理模拟的高效深度学习模型

大模型解决方案白皮书：社交陪伴场景全流程落地指南

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Learning Individual Treatment Effects under Heterogeneous Interference in Networks

Arxiv

0+阅读 · 2022年10月25日

A Task Allocation Framework for Human Multi-Robot Collaborative Settings

Arxiv

0+阅读 · 2022年10月25日

GlobalFlowNet: Video Stabilization using Deep Distilled Global Motion Estimates

Arxiv

0+阅读 · 2022年10月25日

Ground-Truth Labels Matter: A Deeper Look into Input-Label Demonstrations

Ground-Truth Labels Matter: A Deeper Look into Input-Label Demonstrations

Arxiv

0+阅读 · 2022年10月24日

Simultaneous Improvement of ML Model Fairness and Performance by Identifying Bias in Data

Arxiv

0+阅读 · 2022年10月24日

Clustering with fair-center representation: parameterized approximation algorithms and heuristics

Arxiv

0+阅读 · 2022年10月24日

Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems

Arxiv

0+阅读 · 2022年10月22日

NESTANets: Stable, accurate and efficient neural networks for analysis-sparse inverse problems

Arxiv

0+阅读 · 2022年10月20日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

microRNA与新生鼠缺氧缺血脑损伤的细胞自噬调控

国家自然科学基金

0+阅读 · 2013年12月31日

准一维自旋链与自旋梯子共存Sr14Cu24O41体系电荷有序行为的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

流形上的Bakry-Emery曲率，泛函不等式和热核分析

国家自然科学基金

0+阅读 · 2012年12月31日

流形上整体几何与几何分析的若干研究

国家自然科学基金

0+阅读 · 2012年12月31日

Erbin介导细胞周期异常与肿瘤发生的关系

国家自然科学基金

0+阅读 · 2012年12月31日

一类拟线性Schrodinger方程(组)解的存在性和集中现象研究

国家自然科学基金

0+阅读 · 2012年12月31日

考虑班轮公司和货代公司委托代理关系的集装箱调度管理

国家自然科学基金

0+阅读 · 2011年12月31日

动脉粥样硬化中PPARγ19978;调c-Ski的机制及作用研究

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员