关于最佳早期制止:过度信息化与不足信息化的平衡化 (On Optimal Early Stopping: Over-informative versus Under-informative Parametrization) - 专知论文

会员服务 ·

0

早停 · 优化器 · MoDELS · Neural Networks · 线性的 ·

2022 年 2 月 20 日

On Optimal Early Stopping: Over-informative versus Under-informative Parametrization

翻译：关于最佳早期制止:过度信息化与不足信息化的平衡化

Ruoqi Shen,Liyao Gao,Yian Ma

from arxiv, 30 pages, 15 figures

Early stopping is a simple and widely used method to prevent over-training neural networks. We develop theoretical results to reveal the relationship between the optimal early stopping time and model dimension as well as sample size of the dataset for certain linear models. Our results demonstrate two very different behaviors when the model dimension exceeds the number of features versus the opposite scenario. While most previous works on linear models focus on the latter setting, we observe that the dimension of the model often exceeds the number of features arising from data in common deep learning tasks and propose a model to study this setting. We demonstrate experimentally that our theoretical results on optimal early stopping time corresponds to the training process of deep neural networks.

翻译：早期停止是防止过度训练神经网络的简单和广泛使用的方法。我们开发理论结果,以揭示最佳早期停止时间和模型层面之间的关系以及某些线性模型数据集的样本大小。我们的结果显示,当模型层面超过特征数量而相反的情景时,两种非常不同的行为。虽然以前关于线性模型的大部分工作侧重于后一种环境,但我们认为,模型的层面往往超过共同深层学习任务中数据产生的特征数量,并提出研究这一环境的模式。我们实验性地表明,我们关于最佳早期停止时间的理论结果与深层神经网络的培训过程相吻合。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【WWW2021】张量时间序列网络

【WWW2021】张量时间序列网络

专知会员服务

44+阅读 · 2021年4月20日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【Google】具有秩-1因子的高效可扩展贝叶斯神经网络，Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors

【Google】具有秩-1因子的高效可扩展贝叶斯神经网络，Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors

专知会员服务

14+阅读 · 2020年5月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

孤独症的iPSC模型研究

国家自然科学基金

1+阅读 · 2015年12月31日

具有群作用CR流形上的Morse不等式

国家自然科学基金

0+阅读 · 2015年12月31日

新型胶质瘤复合抗原DC疫苗抗肿瘤免疫的功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

星形胶质细胞内源性PLD正性调控树突的发育

国家自然科学基金

0+阅读 · 2013年12月31日

低秩张量优化问题的模型、算法及应用

国家自然科学基金

5+阅读 · 2013年12月31日

血管内皮前体细胞对人胚胎胰腺干细胞向胰岛内分泌细胞分化的影响及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

高阶多元Markov链及其非负张量模型的理论与数值分析

国家自然科学基金

1+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

光基因调控脊髓损伤小鼠步行CPG研究

国家自然科学基金

0+阅读 · 2011年12月31日

图的正则性和胞腔代数

国家自然科学基金

0+阅读 · 2008年12月31日

Entropy Regularized Optimal Transport Independence Criterion

Arxiv

0+阅读 · 2022年4月20日

Deep Equilibrium Optical Flow Estimation

Arxiv

0+阅读 · 2022年4月18日

Optimal Subsampling for High-dimensional Ridge Regression

Arxiv

0+阅读 · 2022年4月18日

Benign Overfitting in Time Series Linear Model with Over-Parameterization

Benign Overfitting in Time Series Linear Model with Over-Parameterization

Arxiv

0+阅读 · 2022年4月18日

Optimal Coding Theorems in Time-Bounded Kolmogorov Complexity

Arxiv

0+阅读 · 2022年4月18日

Dynamic Approximate Maximum Independent Set on Massive Graphs

Arxiv

0+阅读 · 2022年4月18日

Risk and optimal policies in bandit experiments

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年4月18日

Wavelet Moments for Cosmological Parameter Estimation

Arxiv

0+阅读 · 2022年4月15日

Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration

Arxiv

0+阅读 · 2022年4月15日

A Statistical Decision-Theoretical Perspective on the Two-Stage Approach to Parameter Estimation

Arxiv

0+阅读 · 2022年4月15日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【WWW2021】张量时间序列网络

【WWW2021】张量时间序列网络

专知会员服务

44+阅读 · 2021年4月20日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【Google】具有秩-1因子的高效可扩展贝叶斯神经网络，Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors

【Google】具有秩-1因子的高效可扩展贝叶斯神经网络，Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors

专知会员服务

14+阅读 · 2020年5月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025】基于奖励引导解码的多模态大语言模型控制

【CMU博士论文】基于深度学习的高效贝叶斯实验设计

《数据安全国家标准体系（2025版）》征求意见稿

2025年中国AI算力基础设施发展趋势洞察

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Entropy Regularized Optimal Transport Independence Criterion

Arxiv

0+阅读 · 2022年4月20日

Deep Equilibrium Optical Flow Estimation

Arxiv

0+阅读 · 2022年4月18日

Optimal Subsampling for High-dimensional Ridge Regression

Arxiv

0+阅读 · 2022年4月18日

Benign Overfitting in Time Series Linear Model with Over-Parameterization

Benign Overfitting in Time Series Linear Model with Over-Parameterization

Arxiv

0+阅读 · 2022年4月18日

Optimal Coding Theorems in Time-Bounded Kolmogorov Complexity

Arxiv

0+阅读 · 2022年4月18日

Dynamic Approximate Maximum Independent Set on Massive Graphs

Arxiv

0+阅读 · 2022年4月18日

Risk and optimal policies in bandit experiments

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年4月18日

Wavelet Moments for Cosmological Parameter Estimation

Arxiv

0+阅读 · 2022年4月15日

Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration

Arxiv

0+阅读 · 2022年4月15日

A Statistical Decision-Theoretical Perspective on the Two-Stage Approach to Parameter Estimation

Arxiv

0+阅读 · 2022年4月15日

相关基金

孤独症的iPSC模型研究

国家自然科学基金

1+阅读 · 2015年12月31日

具有群作用CR流形上的Morse不等式

国家自然科学基金

0+阅读 · 2015年12月31日

新型胶质瘤复合抗原DC疫苗抗肿瘤免疫的功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

星形胶质细胞内源性PLD正性调控树突的发育

国家自然科学基金

0+阅读 · 2013年12月31日

低秩张量优化问题的模型、算法及应用

国家自然科学基金

5+阅读 · 2013年12月31日

血管内皮前体细胞对人胚胎胰腺干细胞向胰岛内分泌细胞分化的影响及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

高阶多元Markov链及其非负张量模型的理论与数值分析

国家自然科学基金

1+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

光基因调控脊髓损伤小鼠步行CPG研究

国家自然科学基金

0+阅读 · 2011年12月31日

图的正则性和胞腔代数

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员