理解为什么普遍加权加权不改进机构风险管理 (Understanding Why Generalized Reweighting Does Not Improve Over ERM) - 专知论文

会员服务 ·

0

可理解性 · 稳健性 · 泛化理论 · MoDELS · 迭代模型参数 ·

2023 年 2 月 7 日

Understanding Why Generalized Reweighting Does Not Improve Over ERM

翻译：理解为什么普遍加权加权不改进机构风险管理

Runtian Zhai,Chen Dan,Zico Kolter,Pradeep Ravikumar

from arxiv, ICLR 2023. 40 pages, 3 figures

Empirical risk minimization (ERM) is known in practice to be non-robust to distributional shift where the training and the test distributions are different. A suite of approaches, such as importance weighting, and variants of distributionally robust optimization (DRO), have been proposed to solve this problem. But a line of recent work has empirically shown that these approaches do not significantly improve over ERM in real applications with distribution shift. The goal of this work is to obtain a comprehensive theoretical understanding of this intriguing phenomenon. We first posit the class of Generalized Reweighting (GRW) algorithms, as a broad category of approaches that iteratively update model parameters based on iterative reweighting of the training samples. We show that when overparameterized models are trained under GRW, the resulting models are close to that obtained by ERM. We also show that adding small regularization which does not greatly affect the empirical training accuracy does not help. Together, our results show that a broad category of what we term GRW approaches are not able to achieve distributionally robust generalization. Our work thus has the following sobering takeaway: to make progress towards distributionally robust generalization, we either have to develop non-GRW approaches, or perhaps devise novel classification/regression loss functions that are adapted to the class of GRW approaches.

翻译：在实践中,经验风险最小化(ERM)在实践上是已知的,在培训和测试分布不同的情况下,并不局限于分布式转换。为了解决这一问题,已经提出了一套方法,例如重要性加权,以及分布性强优化的变体(DRO)等。但最近的一系列工作从经验上表明,这些方法在实际应用中并没有随着分布转移而在实际应用中显著改善。这项工作的目标是获得对这一令人感兴趣的现象的全面理论理解。我们首先将普遍加权算法(GRW)视为一个广泛的方法类别,根据对培训样本的迭接重加权,反复更新模型参数。我们表明,在GRW下培训过分模型时,由此产生的模型接近于机构风险管理所获得的模型。我们还表明,增加小规模的正规化不会极大地影响经验培训的准确性。我们的结果共同表明,我们所说的GRW方法的广泛类别无法实现分布式稳健的普及化。我们的工作因此具有以下的清醒的类别:使分配性模式在GRW模式的迭代重重重调整方法上取得进展,或许使GRW的升级方法升级到GR的升级。

0

相关内容

可理解性

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【元宇宙】“The State Of The Metaverse”26页报告

【元宇宙】“The State Of The Metaverse”26页报告

专知会员服务

45+阅读 · 2022年5月25日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

Ferroportin1（FPN1)基因对破骨细胞分化和功能的调控及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于meet/miss-in-the-middle思想若干密码攻击方法的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

S100A4-miR155在肝癌组织间充质干细胞调控肝癌增殖及转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ghrelin对胰岛β细胞分泌胰岛素和增殖的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

急性肺损伤中颗粒蛋白前体的microRNA调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

杂交有限元法的自适应理论与快速算法

国家自然科学基金

0+阅读 · 2011年12月31日

Legumain在乳腺癌骨转移和破骨损伤过程中的作用机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Understanding and Improving Features Learned in Deep Functional Maps

Arxiv

0+阅读 · 2023年3月29日

Operator learning with PCA-Net: upper and lower complexity bounds

Arxiv

0+阅读 · 2023年3月28日

Statistical Inference with Stochastic Gradient Methods under $φ$-mixing Data

Arxiv

0+阅读 · 2023年3月28日

Tackling the infinite likelihood problem when fitting mixtures of shifted asymmetric Laplace distributions

Arxiv

0+阅读 · 2023年3月24日

$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference

Arxiv

0+阅读 · 2023年3月24日

Bootstrap-Assisted Inference for Generalized Grenander-type Estimators

Arxiv

0+阅读 · 2023年3月23日

A Closer Look at Scoring Functions and Generalization Prediction

Arxiv

0+阅读 · 2023年3月23日

Causality and Generalizability: Identifiability and Learning Methods

Arxiv

12+阅读 · 2021年10月4日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

Learning from Very Few Samples: A Survey

Arxiv

126+阅读 · 2020年9月6日

VIP会员

文章信息

相关主题

迭代模型参数

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【元宇宙】“The State Of The Metaverse”26页报告

【元宇宙】“The State Of The Metaverse”26页报告

专知会员服务

45+阅读 · 2022年5月25日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

乌克兰太空研究（2022-2024年） | 176页

新型军用战斗机无人机（MFUAV’s）| 2025最新80页

国防领域人工智能走向何方？

无人机对士兵的心理影响

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Understanding and Improving Features Learned in Deep Functional Maps

Arxiv

0+阅读 · 2023年3月29日

Operator learning with PCA-Net: upper and lower complexity bounds

Arxiv

0+阅读 · 2023年3月28日

Statistical Inference with Stochastic Gradient Methods under $φ$-mixing Data

Arxiv

0+阅读 · 2023年3月28日

Tackling the infinite likelihood problem when fitting mixtures of shifted asymmetric Laplace distributions

Arxiv

0+阅读 · 2023年3月24日

$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference

Arxiv

0+阅读 · 2023年3月24日

Bootstrap-Assisted Inference for Generalized Grenander-type Estimators

Arxiv

0+阅读 · 2023年3月23日

A Closer Look at Scoring Functions and Generalization Prediction

Arxiv

0+阅读 · 2023年3月23日

Causality and Generalizability: Identifiability and Learning Methods

Arxiv

12+阅读 · 2021年10月4日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

Learning from Very Few Samples: A Survey

Arxiv

126+阅读 · 2020年9月6日

相关基金

Ferroportin1（FPN1)基因对破骨细胞分化和功能的调控及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于meet/miss-in-the-middle思想若干密码攻击方法的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

S100A4-miR155在肝癌组织间充质干细胞调控肝癌增殖及转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ghrelin对胰岛β细胞分泌胰岛素和增殖的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

急性肺损伤中颗粒蛋白前体的microRNA调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

杂交有限元法的自适应理论与快速算法

国家自然科学基金

0+阅读 · 2011年12月31日

Legumain在乳腺癌骨转移和破骨损伤过程中的作用机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员