权重权重是否与国际刑警分类不相容? (Is Importance Weighting Incompatible with Interpolating Classifiers?) - 专知论文

会员服务 ·

0

Weight · 损失函数（机器学习） · Neural Networks · 损失 · 模型评估 ·

2021 年 12 月 24 日

Is Importance Weighting Incompatible with Interpolating Classifiers?

翻译：权重权重是否与国际刑警分类不相容?

Ke Alexander Wang,Niladri S. Chatterji,Saminul Haque,Tatsunori Hashimoto

Importance weighting is a classic technique to handle distribution shifts. However, prior work has presented strong empirical and theoretical evidence demonstrating that importance weights can have little to no effect on overparameterized neural networks. Is importance weighting truly incompatible with the training of overparameterized neural networks? Our paper answers this in the negative. We show that importance weighting fails not because of the overparameterization, but instead, as a result of using exponentially-tailed losses like the logistic or cross-entropy loss. As a remedy, we show that polynomially-tailed losses restore the effects of importance reweighting in correcting distribution shift in overparameterized models. We characterize the behavior of gradient descent on importance weighted polynomially-tailed losses with overparameterized linear models, and theoretically demonstrate the advantage of using polynomially-tailed losses in a label shift setting. Surprisingly, our theory shows that using weights that are obtained by exponentiating the classical unbiased importance weights can improve performance. Finally, we demonstrate the practical value of our analysis with neural network experiments on a subpopulation shift and a label shift dataset. When reweighted, our loss function can outperform reweighted cross-entropy by as much as 9% in test accuracy. Our loss function also gives test accuracies comparable to, or even exceeding, well-tuned state-of-the-art methods for correcting distribution shifts.

翻译：重量权重是处理分配变化的经典技术。然而,先前的工作提供了有力的经验和理论证据,表明重量权重对过度参数化的神经网络没有多大影响。重量权重是否真的与过度参数化神经网络的培训不相容?我们的论文对此做了否定回答。我们显示,重量权重不是因为过分参数化而失败,而是由于使用诸如物流或跨性器官损失等指数性尾量损失而导致的。但作为一种补救措施,我们显示,在纠正过度参数化的模型中,多元尾量损失恢复了重估分布变化的重要性。我们把梯度下降行为与加权多度成尾数的线性神经网络网络培训完全不相容吗?我们用过度参数化线性模型从理论上证明,在标签变换位时使用多元成尾量损失损失的优势。令人惊讶的是,我们的理论表明,使用偏偏重的不偏重的纯度重量值可以提高绩效。最后,我们展示了我们分析网络对亚人口结构变换或超度变换的重量值实验的实际价值,我们的数据也测试了比重性调整了我们的损失率性调整。

0

相关内容

Weight

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

最浅显的奇异值分解(SVD)介绍，《Singular Value Decomposition as Simply as Possible》

最浅显的奇异值分解(SVD)介绍，《Singular Value Decomposition as Simply as Possible》

专知会员服务

12+阅读 · 2022年3月14日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

专知会员服务

148+阅读 · 2020年4月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

论文共读 | Attention is All You Need

论文共读 | Attention is All You Need

黑龙江大学自然语言处理实验室

14+阅读 · 2017年9月7日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

Notch通路相关基因甲基化对牙发育的调控机制研究

国家自然科学基金

1+阅读 · 2015年12月31日

自旋轨道耦合莫特绝缘体的量子磁性调控

国家自然科学基金

0+阅读 · 2014年12月31日

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

旋毛虫副肌球蛋白H-2d限制性Th表位的鉴定及Th-B表位肽嵌合疫苗免疫学效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于不确定性理论的遥感图像几何校正模型优化及定位精度评价方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

嵌段共聚物多级自组装模拟分子伴侣的结构与功能

国家自然科学基金

1+阅读 · 2011年12月31日

LHC上中性规范玻色子反常耦合效应的研究

国家自然科学基金

0+阅读 · 2011年12月31日

激活Notch信号通路促进新骨形成

国家自然科学基金

0+阅读 · 2011年12月31日

抗原特异性和非抗原特异性CD4+CD25+ Treg细胞对Th1细胞分化、效应功能和记忆Th1细胞形成的影响

国家自然科学基金

0+阅读 · 2008年12月31日

FocusNet: Classifying Better by Focusing on Confusing Classes

Arxiv

0+阅读 · 2022年4月20日

Distilling and Transferring Knowledge via cGAN-generated Samples for Image Classification and Regression

Arxiv

0+阅读 · 2022年4月19日

Radio Galaxy Zoo: Using semi-supervised learning to leverage large unlabelled data-sets for radio galaxy classification under data-set shift

Arxiv

0+阅读 · 2022年4月19日

Optimal Subsampling for High-dimensional Ridge Regression

Arxiv

0+阅读 · 2022年4月18日

Risk and optimal policies in bandit experiments

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年4月18日

Transfer Learning under High-dimensional Generalized Linear Models

Arxiv

0+阅读 · 2022年4月17日

On the Importance of Firth Bias Reduction in Few-Shot Classification

Arxiv

0+阅读 · 2022年4月14日

This is the Moment for Probabilistic Loops

Arxiv

0+阅读 · 2022年4月14日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

损失函数（机器学习）

Neural Networks

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

最浅显的奇异值分解(SVD)介绍，《Singular Value Decomposition as Simply as Possible》

最浅显的奇异值分解(SVD)介绍，《Singular Value Decomposition as Simply as Possible》

专知会员服务

12+阅读 · 2022年3月14日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

专知会员服务

148+阅读 · 2020年4月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

论文共读 | Attention is All You Need

论文共读 | Attention is All You Need

黑龙江大学自然语言处理实验室

14+阅读 · 2017年9月7日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

FocusNet: Classifying Better by Focusing on Confusing Classes

Arxiv

0+阅读 · 2022年4月20日

Distilling and Transferring Knowledge via cGAN-generated Samples for Image Classification and Regression

Arxiv

0+阅读 · 2022年4月19日

Radio Galaxy Zoo: Using semi-supervised learning to leverage large unlabelled data-sets for radio galaxy classification under data-set shift

Arxiv

0+阅读 · 2022年4月19日

Optimal Subsampling for High-dimensional Ridge Regression

Arxiv

0+阅读 · 2022年4月18日

Risk and optimal policies in bandit experiments

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年4月18日

Transfer Learning under High-dimensional Generalized Linear Models

Arxiv

0+阅读 · 2022年4月17日

On the Importance of Firth Bias Reduction in Few-Shot Classification

Arxiv

0+阅读 · 2022年4月14日

This is the Moment for Probabilistic Loops

Arxiv

0+阅读 · 2022年4月14日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

Notch通路相关基因甲基化对牙发育的调控机制研究

国家自然科学基金

1+阅读 · 2015年12月31日

自旋轨道耦合莫特绝缘体的量子磁性调控

国家自然科学基金

0+阅读 · 2014年12月31日

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

旋毛虫副肌球蛋白H-2d限制性Th表位的鉴定及Th-B表位肽嵌合疫苗免疫学效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于不确定性理论的遥感图像几何校正模型优化及定位精度评价方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

嵌段共聚物多级自组装模拟分子伴侣的结构与功能

国家自然科学基金

1+阅读 · 2011年12月31日

LHC上中性规范玻色子反常耦合效应的研究

国家自然科学基金

0+阅读 · 2011年12月31日

激活Notch信号通路促进新骨形成

国家自然科学基金

0+阅读 · 2011年12月31日

抗原特异性和非抗原特异性CD4+CD25+ Treg细胞对Th1细胞分化、效应功能和记忆Th1细胞形成的影响

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员