Accuracy on the Curve: On the Nonlinear Correlation of ML Performance Between Data Subpopulations - 专知论文

会员服务 ·

0

相关系数 · Performer · MoDELS · 可理解性 · ML ·

2023 年 5 月 31 日

Accuracy on the Curve: On the Nonlinear Correlation of ML Performance Between Data Subpopulations

翻译：暂无翻译

Weixin Liang,Yining Mao,Yongchan Kwon,Xinyu Yang,James Zou

from arxiv, Accepted to the main conference of ICML 2023

Understanding the performance of machine learning (ML) models across diverse data distributions is critically important for reliable applications. Despite recent empirical studies positing a near-perfect linear correlation between in-distribution (ID) and out-of-distribution (OOD) accuracies, we empirically demonstrate that this correlation is more nuanced under subpopulation shifts. Through rigorous experimentation and analysis across a variety of datasets, models, and training epochs, we demonstrate that OOD performance often has a nonlinear correlation with ID performance in subpopulation shifts. Our findings, which contrast previous studies that have posited a linear correlation in model performance during distribution shifts, reveal a "moon shape" correlation (parabolic uptrend curve) between the test performance on the majority subpopulation and the minority subpopulation. This non-trivial nonlinear correlation holds across model architectures, hyperparameters, training durations, and the imbalance between subpopulations. Furthermore, we found that the nonlinearity of this "moon shape" is causally influenced by the degree of spurious correlations in the training data. Our controlled experiments show that stronger spurious correlation in the training data creates more nonlinear performance correlation. We provide complementary experimental and theoretical analyses for this phenomenon, and discuss its implications for ML reliability and fairness. Our work highlights the importance of understanding the nonlinear effects of model improvement on performance in different subpopulations, and has the potential to inform the development of more equitable and responsible machine learning models.

翻译：暂无翻译

0

相关内容

相关系数

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于稀疏互质阵列的DOA估计算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

碳氮石墨烯/金属氧化物异质结材料的构建及其光催化降解多环芳烃类污染物研究

国家自然科学基金

0+阅读 · 2015年12月31日

肺循环肿瘤细胞分子表型鉴定

国家自然科学基金

0+阅读 · 2014年12月31日

Lévy过程轨道空间上的拟不变性与泛函不等式

国家自然科学基金

0+阅读 · 2013年12月31日

星形胶质细胞-神经元转分化治疗新生儿缺氧缺血性脑病的研究

国家自然科学基金

0+阅读 · 2012年12月31日

趋化因子受体CXCR3在脊髓小胶质细胞活化和慢性疼痛中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

核磁共振弹性成像法反演人体组织粘弹性的算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺神经内分泌细胞功能改变与慢性前列腺炎关系的研究

国家自然科学基金

0+阅读 · 2011年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

建立FTA对中国地区经济发展影响的测度模型与实证研究

国家自然科学基金

0+阅读 · 2008年12月31日

On the Complexity of the Bipartite Polarization Problem: from Neutral to Highly Polarized Discussions

Arxiv

0+阅读 · 2023年7月21日

Transformer-based end-to-end classification of variable-length volumetric data

Arxiv

0+阅读 · 2023年7月21日

Batching for Green AI -- An Exploratory Study on Inference

Arxiv

0+阅读 · 2023年7月21日

Perceptron Theory Can Predict the Accuracy of Neural Networks

Arxiv

0+阅读 · 2023年7月20日

Adversarial attacks for mixtures of classifiers

Arxiv

0+阅读 · 2023年7月20日

Planning with Dynamically Estimated Action Costs

Arxiv

0+阅读 · 2023年7月19日

Constrained D-optimal Design for Paid Research Study

Arxiv

0+阅读 · 2023年7月19日

Non-parametric inference on calibration of predicted risks

Arxiv

0+阅读 · 2023年7月19日

Primal Estimated Subgradient Solver for SVM for Imbalanced Classification

Arxiv

0+阅读 · 2023年7月18日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津大学博士论文】将序列结构与几何结构融入深度神经网络

工程视角：影响战争进程的小型无人机

企业级AI应用开发：从技术选型到生产落地

AI生成代码缺陷综述

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

On the Complexity of the Bipartite Polarization Problem: from Neutral to Highly Polarized Discussions

Arxiv

0+阅读 · 2023年7月21日

Transformer-based end-to-end classification of variable-length volumetric data

Arxiv

0+阅读 · 2023年7月21日

Batching for Green AI -- An Exploratory Study on Inference

Arxiv

0+阅读 · 2023年7月21日

Perceptron Theory Can Predict the Accuracy of Neural Networks

Arxiv

0+阅读 · 2023年7月20日

Adversarial attacks for mixtures of classifiers

Arxiv

0+阅读 · 2023年7月20日

Planning with Dynamically Estimated Action Costs

Arxiv

0+阅读 · 2023年7月19日

Constrained D-optimal Design for Paid Research Study

Arxiv

0+阅读 · 2023年7月19日

Non-parametric inference on calibration of predicted risks

Arxiv

0+阅读 · 2023年7月19日

Primal Estimated Subgradient Solver for SVM for Imbalanced Classification

Arxiv

0+阅读 · 2023年7月18日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

相关基金

基于稀疏互质阵列的DOA估计算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

碳氮石墨烯/金属氧化物异质结材料的构建及其光催化降解多环芳烃类污染物研究

国家自然科学基金

0+阅读 · 2015年12月31日

肺循环肿瘤细胞分子表型鉴定

国家自然科学基金

0+阅读 · 2014年12月31日

Lévy过程轨道空间上的拟不变性与泛函不等式

国家自然科学基金

0+阅读 · 2013年12月31日

星形胶质细胞-神经元转分化治疗新生儿缺氧缺血性脑病的研究

国家自然科学基金

0+阅读 · 2012年12月31日

趋化因子受体CXCR3在脊髓小胶质细胞活化和慢性疼痛中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

核磁共振弹性成像法反演人体组织粘弹性的算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺神经内分泌细胞功能改变与慢性前列腺炎关系的研究

国家自然科学基金

0+阅读 · 2011年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

建立FTA对中国地区经济发展影响的测度模型与实证研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员