VC VC理论解释 " 双重来源 " (VC Theoretical Explanation of Double Descent) - 专知论文

会员服务 ·

0

泛化理论 · 深度前馈网络 · 优化器 · 训练误差 · 欠拟合 ·

2022 年 5 月 31 日

VC Theoretical Explanation of Double Descent

翻译：VC VC理论解释 " 双重来源 "

Eng Hock Lee,Vladimir Cherkassky

There has been growing interest in generalization performance of large multilayer neural networks that can be trained to achieve zero training error, while generalizing well on test data. This regime is known as 'second descent' and it appears to contradict conventional view that optimal model complexity should reflect optimal balance between underfitting and overfitting, aka the bias-variance trade-off. This paper presents VC-theoretical analysis of double descent and shows that it can be fully explained by classical VC generalization bounds. We illustrate an application of analytic VC-bounds for modeling double descent for classification problems, using empirical results for several learning methods, such as SVM, Least Squares, and Multilayer Perceptron classifiers. In addition, we discuss several possible reasons for misinterpretation of VC-theoretical results in the machine learning community.

翻译：人们日益关注大型多层神经网络的通用性能,这些网络可以被训练达到零培训错误,同时对测试数据进行全面推广。这个制度被称为“第二位 ”, 并且似乎与以下常规观点相矛盾,即最佳模型复杂性应反映配制不足和配配制过度之间的最佳平衡,也就是偏差权衡。本文介绍了对双层的VC理论分析,并表明它可以用传统的VC概括性界限来充分解释。我们用SVM、最小广场和多层截取分解器等多种学习方法的经验结果来说明分类问题,我们用分析性VC标准来模拟双位的VC标准。此外,我们讨论了在机器学习界错误解释VC理论结果的若干可能原因。

0

相关内容

泛化理论

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【新书】贝叶斯网络进展与新应用，附全书下载

【新书】贝叶斯网络进展与新应用，附全书下载

专知会员服务

122+阅读 · 2019年12月9日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Heisenberg 群上的 k-平面变换

国家自然科学基金

0+阅读 · 2015年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

LOC283683-NIPA1-BMPRII途径对胆固醇平衡和动脉粥样硬化的影响及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

镍-Lewis酸协同催化C-C键选择性活化与重组反应机制的理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

华北区域不同类型站点大气氨特征及其对PM2.5形成的影响研究

国家自然科学基金

0+阅读 · 2012年12月31日

中高层大气潮汐与重力波的非线性相互作用

国家自然科学基金

0+阅读 · 2009年12月31日

CIC3-dynamin-1信号通路在神经突触囊泡胞吞过程中的作用及调控机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

过渡金属催化卤代芳烃对芳醛的Barbier类型反应研究

国家自然科学基金

0+阅读 · 2009年12月31日

蛋白酶激活受体2参与慢性胰腺炎疼痛形成的机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Gradient-based data and parameter dimension reduction for Bayesian models: an information theoretic perspective

Arxiv

0+阅读 · 2022年7月18日

A Meta-Reinforcement Learning Algorithm for Causal Discovery

Arxiv

0+阅读 · 2022年7月18日

Fast Convergence of Optimistic Gradient Ascent in Network Zero-Sum Extensive Form Games

Arxiv

0+阅读 · 2022年7月18日

Identifying Causal Structure in Dynamical Systems

Arxiv

0+阅读 · 2022年7月18日

Optimal Nonparametric Inference with Two-Scale Distributional Nearest Neighbors

Arxiv

0+阅读 · 2022年7月18日

A Simple Test-Time Method for Out-of-Distribution Detection

Arxiv

0+阅读 · 2022年7月17日

Imaginary Zeroth-Order Optimization

Arxiv

0+阅读 · 2022年7月17日

Understanding the Generalization Performance of Spectral Clustering Algorithms

Arxiv

0+阅读 · 2022年7月17日

Why Do Local Methods Solve Nonconvex Problems?

Arxiv

12+阅读 · 2021年3月24日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

深度前馈网络

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【新书】贝叶斯网络进展与新应用，附全书下载

【新书】贝叶斯网络进展与新应用，附全书下载

专知会员服务

122+阅读 · 2019年12月9日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Gradient-based data and parameter dimension reduction for Bayesian models: an information theoretic perspective

Arxiv

0+阅读 · 2022年7月18日

A Meta-Reinforcement Learning Algorithm for Causal Discovery

Arxiv

0+阅读 · 2022年7月18日

Fast Convergence of Optimistic Gradient Ascent in Network Zero-Sum Extensive Form Games

Arxiv

0+阅读 · 2022年7月18日

Identifying Causal Structure in Dynamical Systems

Arxiv

0+阅读 · 2022年7月18日

Optimal Nonparametric Inference with Two-Scale Distributional Nearest Neighbors

Arxiv

0+阅读 · 2022年7月18日

A Simple Test-Time Method for Out-of-Distribution Detection

Arxiv

0+阅读 · 2022年7月17日

Imaginary Zeroth-Order Optimization

Arxiv

0+阅读 · 2022年7月17日

Understanding the Generalization Performance of Spectral Clustering Algorithms

Arxiv

0+阅读 · 2022年7月17日

Why Do Local Methods Solve Nonconvex Problems?

Arxiv

12+阅读 · 2021年3月24日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Heisenberg 群上的 k-平面变换

国家自然科学基金

0+阅读 · 2015年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

LOC283683-NIPA1-BMPRII途径对胆固醇平衡和动脉粥样硬化的影响及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

镍-Lewis酸协同催化C-C键选择性活化与重组反应机制的理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

华北区域不同类型站点大气氨特征及其对PM2.5形成的影响研究

国家自然科学基金

0+阅读 · 2012年12月31日

中高层大气潮汐与重力波的非线性相互作用

国家自然科学基金

0+阅读 · 2009年12月31日

CIC3-dynamin-1信号通路在神经突触囊泡胞吞过程中的作用及调控机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

过渡金属催化卤代芳烃对芳醛的Barbier类型反应研究

国家自然科学基金

0+阅读 · 2009年12月31日

蛋白酶激活受体2参与慢性胰腺炎疼痛形成的机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员