重新审视扩大起诉对贬低偏见的重要性 (Revisiting the Importance of Amplifying Bias for Debiasing) - 专知论文

会员服务 ·

0

有偏 · 样本 · Performer · state-of-the-art · Less ·

2022 年 12 月 7 日

Revisiting the Importance of Amplifying Bias for Debiasing

翻译：重新审视扩大起诉对贬低偏见的重要性

Jungsoo Lee,Jeonghoon Park,Daeyoung Kim,Juyoung Lee,Edward Choi,Jaegul Choo

from arxiv, Accepted to AAAI 2023

In image classification, "debiasing" aims to train a classifier to be less susceptible to dataset bias, the strong correlation between peripheral attributes of data samples and a target class. For example, even if the frog class in the dataset mainly consists of frog images with a swamp background (i.e., bias-aligned samples), a debiased classifier should be able to correctly classify a frog at a beach (i.e., bias-conflicting samples). Recent debiasing approaches commonly use two components for debiasing, a biased model $f_B$ and a debiased model $f_D$. $f_B$ is trained to focus on bias-aligned samples (i.e., overfitted to the bias) while $f_D$ is mainly trained with bias-conflicting samples by concentrating on samples which $f_B$ fails to learn, leading $f_D$ to be less susceptible to the dataset bias. While the state-of-the-art debiasing techniques have aimed to better train $f_D$, we focus on training $f_B$, an overlooked component until now. Our empirical analysis reveals that removing the bias-conflicting samples from the training set for $f_B$ is important for improving the debiasing performance of $f_D$. This is due to the fact that the bias-conflicting samples work as noisy samples for amplifying the bias for $f_B$ since those samples do not include the bias attribute. To this end, we propose a simple yet effective data sample selection method which removes the bias-conflicting samples to construct a bias-amplified dataset for training $f_B$. Our data sample selection method can be directly applied to existing reweighting-based debiasing approaches, obtaining consistent performance boost and achieving the state-of-the-art performance on both synthetic and real-world datasets.

翻译：在图像分类中, “ 下降偏差” 的目的是训练一个分类器, 使其不易受到数据偏差的偏差偏差, 数据样本的外围属性与目标类之间的紧密关联性关系。例如, 即使数据集中的青蛙类主要由具有沼泽背景的青蛙图像组成( 偏差对比样本), 降低偏差的分类器应该能够在海滩( 即偏差冲突样本) 正确分类青蛙, 从而在沙滩( 即偏差冲突样本 ) 。最近的偏差方法通常使用两个组成部分来降低偏差, 一个偏差的模型$f_ B$ 和一个偏差的模型 $f_ D$ 。 $ _ B$ 被训练专注于偏差的样本( 过度适应偏差), $f_ D$ 被训练为偏差的样本, 导致美元偏差的偏差率偏差的偏差分析。州- 州- 标定的降解方法的目的是更好地训练 $ D$ 。我们现在将这种偏差的选取的选取结果数据用于测试中的推移性能。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

图论中的整数流与圆流

国家自然科学基金

0+阅读 · 2015年12月31日

基于多尺度分解多源遥感图像的融合技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型PMN-PT基铁电光学陶瓷制备及其高电光特性机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

单封端PEG-CHO改性明胶延缓软胶囊老化的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

广西巴马地区长寿群体的认知状况调查及认知相关基因的多态性研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于QAM光载毫米波信号的10Gb/s RoF系统关键技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

幽门螺杆菌益生菌型口服疫苗的研制

国家自然科学基金

0+阅读 · 2008年12月31日

Fair Minimum Representation Clustering

Arxiv

0+阅读 · 2023年2月8日

Prediction approaches for partly missing multi-omics covariate data: A literature review and an empirical comparison study

Arxiv

0+阅读 · 2023年2月8日

Mitigating Algorithmic Bias with Limited Annotations

Arxiv

0+阅读 · 2023年2月7日

Self-Sampling Training and Evaluation for the Accuracy-Bias Tradeoff in Recommendation

Arxiv

0+阅读 · 2023年2月7日

Delving Deep into Simplicity Bias for Long-Tailed Image Recognition

Arxiv

0+阅读 · 2023年2月7日

The R2D2 Prior for Generalized Linear Mixed Models

Arxiv

0+阅读 · 2023年2月7日

Guide the Learner: Controlling Product of Experts Debiasing Method Based on Token Attribution Similarities

Arxiv

0+阅读 · 2023年2月6日

Open Reproducible Publication Research

Arxiv

0+阅读 · 2023年2月6日

FineDeb: A Debiasing Framework for Language Models

Arxiv

0+阅读 · 2023年2月5日

Improving Fair Training under Correlation Shifts

Arxiv

0+阅读 · 2023年2月5日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《巡飞弹药（爆炸性无人机）威胁态势分析》最新24页报告

《军用后勤无人机：破解战场运输挑战的创新方案》

人工智能战争：以色列、伊朗与新型AI战争形态

《俄乌战争：现代战争未来的启示与经验》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Fair Minimum Representation Clustering

Arxiv

0+阅读 · 2023年2月8日

Prediction approaches for partly missing multi-omics covariate data: A literature review and an empirical comparison study

Arxiv

0+阅读 · 2023年2月8日

Mitigating Algorithmic Bias with Limited Annotations

Arxiv

0+阅读 · 2023年2月7日

Self-Sampling Training and Evaluation for the Accuracy-Bias Tradeoff in Recommendation

Arxiv

0+阅读 · 2023年2月7日

Delving Deep into Simplicity Bias for Long-Tailed Image Recognition

Arxiv

0+阅读 · 2023年2月7日

The R2D2 Prior for Generalized Linear Mixed Models

Arxiv

0+阅读 · 2023年2月7日

Guide the Learner: Controlling Product of Experts Debiasing Method Based on Token Attribution Similarities

Arxiv

0+阅读 · 2023年2月6日

Open Reproducible Publication Research

Arxiv

0+阅读 · 2023年2月6日

FineDeb: A Debiasing Framework for Language Models

Arxiv

0+阅读 · 2023年2月5日

Improving Fair Training under Correlation Shifts

Arxiv

0+阅读 · 2023年2月5日

相关基金

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

图论中的整数流与圆流

国家自然科学基金

0+阅读 · 2015年12月31日

基于多尺度分解多源遥感图像的融合技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型PMN-PT基铁电光学陶瓷制备及其高电光特性机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

单封端PEG-CHO改性明胶延缓软胶囊老化的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

广西巴马地区长寿群体的认知状况调查及认知相关基因的多态性研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于QAM光载毫米波信号的10Gb/s RoF系统关键技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

幽门螺杆菌益生菌型口服疫苗的研制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员