反事实数据增加的偏见挑战 (Bias Challenges in Counterfactual Data Augmentation) - 专知论文

会员服务 ·

0

数据增强 · 稳健性 · 有偏 · Performer · Learning ·

2022 年 9 月 13 日

Bias Challenges in Counterfactual Data Augmentation

翻译：反事实数据增加的偏见挑战

S Chandra Mouli,Yangze Zhou,Bruno Ribeiro

from arxiv, Accepted at UAI 2022 Workshop on Causal Representation Learning

Deep learning models tend not to be out-of-distribution robust primarily due to their reliance on spurious features to solve the task. Counterfactual data augmentations provide a general way of (approximately) achieving representations that are counterfactual-invariant to spurious features, a requirement for out-of-distribution (OOD) robustness. In this work, we show that counterfactual data augmentations may not achieve the desired counterfactual-invariance if the augmentation is performed by a context-guessing machine, an abstract machine that guesses the most-likely context of a given input. We theoretically analyze the invariance imposed by such counterfactual data augmentations and describe an exemplar NLP task where counterfactual data augmentation by a context-guessing machine does not lead to robust OOD classifiers.

翻译：深层学习模型往往不会失去分配能力,主要是因为它们依赖虚假特征来完成任务。反事实数据增强提供了一种一般性的表达方式(约)实现与虚假特征相对的反实际变异,这是分配(OOOD)强力的要求。在这项工作中,我们表明,反事实数据增强如果由上下文猜测机器进行增量,则可能无法实现预期的反事实变异,而后者是一个抽象的机器,可以猜测某项输入的最可能的背景。我们从理论上分析了此类反事实数据增量造成的变异,并描述了一个超现实 NLP任务,即由上下文猜测的机器进行的反事实数据增强不会导致稳健的 OOD 分类器。

0

相关内容

数据增强

数据增强在机器学习领域多指采用一些方法（比如数据蒸馏，正负样本均衡等）来提高模型数据集的质量，增强数据。

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

专知会员服务

19+阅读 · 2022年3月4日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

高阶微分方程的周期解及多重性

国家自然科学基金

0+阅读 · 2015年12月31日

面向10Tb/in2级磁存储系统的二维LDPC码设计

国家自然科学基金

0+阅读 · 2015年12月31日

极硬纳米孪晶氮化硼的高压合成及其性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

最小连通传感器覆盖及其相关问题

国家自然科学基金

0+阅读 · 2014年12月31日

新型半金属性Heusler合金及其半金属性的稳定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

维生素D和维生素D受体基因多态性在2型糖尿病发病中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

1型糖尿病缺血/再灌注心脏易损性增加的新机制：心肌脂联素抵抗

国家自然科学基金

0+阅读 · 2011年12月31日

多维马氏体的数学建模及其高精度数值模拟方法

国家自然科学基金

0+阅读 · 2011年12月31日

海洋天然产物Eudistomin衍生物的设计、合成及抗乙肝病毒构效关系研究

国家自然科学基金

0+阅读 · 2009年12月31日

Ga、Al、In氮化物及其合金和径向异质结纳米线的可控制备和物性研究

国家自然科学基金

0+阅读 · 2008年12月31日

A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation

Arxiv

12+阅读 · 2022年10月21日

The privacy issue of counterfactual explanations: explanation linkage attacks

Arxiv

0+阅读 · 2022年10月21日

Counterfactual Explanations for Reinforcement Learning

Arxiv

0+阅读 · 2022年10月21日

Diffusion Visual Counterfactual Explanations

Arxiv

0+阅读 · 2022年10月21日

Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals

Arxiv

0+阅读 · 2022年10月21日

MoCoDA: Model-based Counterfactual Data Augmentation

MoCoDA: Model-based Counterfactual Data Augmentation

Arxiv

0+阅读 · 2022年10月20日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

Counterfactual Zero-Shot and Open-Set Visual Recognition

Arxiv

12+阅读 · 2021年3月1日

Counterfactual Explanations for Machine Learning: A Review

Arxiv

25+阅读 · 2020年10月20日

VIP会员

文章信息

相关主题

相关VIP内容

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

专知会员服务

19+阅读 · 2022年3月4日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

中文版3500字 | 俄乌前线启示：自主系统、人工智能驱动的后勤保障与量子增强型国防安全重新定义军事战略

《基于二元优化与图学习的多智能体行动方案自动生成》

中文版4000字 | 人工智能如何重塑自适应兵棋推演与战略战备

万字长文《针对无人机航电系统的电子战网络攻击、对抗措施与现代防御策略综述》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

相关论文

A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation

Arxiv

12+阅读 · 2022年10月21日

The privacy issue of counterfactual explanations: explanation linkage attacks

Arxiv

0+阅读 · 2022年10月21日

Counterfactual Explanations for Reinforcement Learning

Arxiv

0+阅读 · 2022年10月21日

Diffusion Visual Counterfactual Explanations

Arxiv

0+阅读 · 2022年10月21日

Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals

Arxiv

0+阅读 · 2022年10月21日

MoCoDA: Model-based Counterfactual Data Augmentation

MoCoDA: Model-based Counterfactual Data Augmentation

Arxiv

0+阅读 · 2022年10月20日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

Counterfactual Zero-Shot and Open-Set Visual Recognition

Arxiv

12+阅读 · 2021年3月1日

Counterfactual Explanations for Machine Learning: A Review

Arxiv

25+阅读 · 2020年10月20日

相关基金

高阶微分方程的周期解及多重性

国家自然科学基金

0+阅读 · 2015年12月31日

面向10Tb/in2级磁存储系统的二维LDPC码设计

国家自然科学基金

0+阅读 · 2015年12月31日

极硬纳米孪晶氮化硼的高压合成及其性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

最小连通传感器覆盖及其相关问题

国家自然科学基金

0+阅读 · 2014年12月31日

新型半金属性Heusler合金及其半金属性的稳定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

维生素D和维生素D受体基因多态性在2型糖尿病发病中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

1型糖尿病缺血/再灌注心脏易损性增加的新机制：心肌脂联素抵抗

国家自然科学基金

0+阅读 · 2011年12月31日

多维马氏体的数学建模及其高精度数值模拟方法

国家自然科学基金

0+阅读 · 2011年12月31日

海洋天然产物Eudistomin衍生物的设计、合成及抗乙肝病毒构效关系研究

国家自然科学基金

0+阅读 · 2009年12月31日

Ga、Al、In氮化物及其合金和径向异质结纳米线的可控制备和物性研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员