SGD 培训的深神经网络一般化错误 (Generalization Error Bounds for Deep Neural Networks Trained by SGD) - 专知论文

会员服务 ·

0

泛化误差上界 · 泛化理论 · 泛化误差 · SGD · Networking ·

2022 年 6 月 7 日

Generalization Error Bounds for Deep Neural Networks Trained by SGD

翻译：SGD 培训的深神经网络一般化错误

Mingze Wang,Chao Ma

Generalization error bounds for deep neural networks trained by stochastic gradient descent (SGD) are derived by combining a dynamical control of an appropriate parameter norm and the Rademacher complexity estimate based on parameter norms. The bounds explicitly depend on the loss along the training trajectory, and work for a wide range of network architectures including multilayer perceptron (MLP) and convolutional neural networks (CNN). Compared with other algorithm-depending generalization estimates such as uniform stability-based bounds, our bounds do not require $L$-smoothness of the nonconvex loss function, and apply directly to SGD instead of Stochastic Langevin gradient descent (SGLD). Numerical results show that our bounds are non-vacuous and robust with the change of optimizer and network hyperparameters.

翻译：通过对适当参数规范进行动态控制,并结合根据参数规范对Rademacher复杂度所作的估计,得出了通过随机梯度梯度下坡法(SGD)训练的深神经网络的一般误差界限。这些误差明确取决于培训轨迹的损失,以及包括多层感官(MLP)和进化神经网络(CNN)在内的广泛网络结构的工程。与其他算法偏差的一般估计(如统一稳定线)相比,我们的界限并不要求非convex损失功能的L$吸附性,而是直接适用于SGD,而不是Stochatic Langevin梯度系(SGLD)。数值结果显示,随着优化和网络超参数的变化,我们的界限是非挥发性的和坚固的。

0

相关内容

泛化误差上界

泛化误差上界

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

GJ的Ca2+传递引起钙稳态失衡诱导内质网应激在肝移植术后急性肾损伤中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

纤维蛋白原γ链D结构域基因点突变导致遗传性纤维蛋白原缺陷症的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于模糊逻辑的大规模强化学习理论及方法

国家自然科学基金

7+阅读 · 2014年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

欠驱动无人飞行器自适应轨迹跟踪与路径跟踪控制

国家自然科学基金

2+阅读 · 2012年12月31日

X伴性遗传病印记效应检测及其关联分析的统计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

无人直升机大机动飞行鲁棒控制

国家自然科学基金

0+阅读 · 2012年12月31日

基于能量变分导数的偏微分方程的时空自适应方法

国家自然科学基金

1+阅读 · 2012年12月31日

融合基因AARS-MADx诱导基因组不稳定性及参与鼻咽癌发生的机制

国家自然科学基金

0+阅读 · 2011年12月31日

用外显子组捕获测序技术鉴定Olmsted型掌跖角化症的致病基因

国家自然科学基金

0+阅读 · 2011年12月31日

Universal Approximation Theorems for Differentiable Geometric Deep Learning

Universal Approximation Theorems for Differentiable Geometric Deep Learning

Arxiv

0+阅读 · 2022年7月25日

Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Arxiv

0+阅读 · 2022年7月24日

Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks

Arxiv

0+阅读 · 2022年7月23日

Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

Arxiv

0+阅读 · 2022年7月22日

Post-training Quantization for Neural Networks with Provable Guarantees

Post-training Quantization for Neural Networks with Provable Guarantees

Arxiv

0+阅读 · 2022年7月22日

Hyper-Representations for Pre-Training and Transfer Learning

Arxiv

0+阅读 · 2022年7月22日

A Theoretical Framework for Inference and Learning in Predictive Coding Networks

Arxiv

0+阅读 · 2022年7月21日

Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications

Arxiv

0+阅读 · 2022年7月20日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

VIP会员

文章信息

相关主题

泛化误差上界

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《战争形态演变：合成兵种防御主导模式探析》48页slides

人工智能驱动弹药制造现代化：美国陆军转型之路

《多域空战指挥体系：驾驭复杂性的艺术》

构建军事人工智能信任体系始于破除黑盒机制

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Universal Approximation Theorems for Differentiable Geometric Deep Learning

Universal Approximation Theorems for Differentiable Geometric Deep Learning

Arxiv

0+阅读 · 2022年7月25日

Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Arxiv

0+阅读 · 2022年7月24日

Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks

Arxiv

0+阅读 · 2022年7月23日

Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

Arxiv

0+阅读 · 2022年7月22日

Post-training Quantization for Neural Networks with Provable Guarantees

Post-training Quantization for Neural Networks with Provable Guarantees

Arxiv

0+阅读 · 2022年7月22日

Hyper-Representations for Pre-Training and Transfer Learning

Arxiv

0+阅读 · 2022年7月22日

A Theoretical Framework for Inference and Learning in Predictive Coding Networks

Arxiv

0+阅读 · 2022年7月21日

Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications

Arxiv

0+阅读 · 2022年7月20日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

相关基金

GJ的Ca2+传递引起钙稳态失衡诱导内质网应激在肝移植术后急性肾损伤中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

纤维蛋白原γ链D结构域基因点突变导致遗传性纤维蛋白原缺陷症的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于模糊逻辑的大规模强化学习理论及方法

国家自然科学基金

7+阅读 · 2014年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

欠驱动无人飞行器自适应轨迹跟踪与路径跟踪控制

国家自然科学基金

2+阅读 · 2012年12月31日

X伴性遗传病印记效应检测及其关联分析的统计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

无人直升机大机动飞行鲁棒控制

国家自然科学基金

0+阅读 · 2012年12月31日

基于能量变分导数的偏微分方程的时空自适应方法

国家自然科学基金

1+阅读 · 2012年12月31日

融合基因AARS-MADx诱导基因组不稳定性及参与鼻咽癌发生的机制

国家自然科学基金

0+阅读 · 2011年12月31日

用外显子组捕获测序技术鉴定Olmsted型掌跖角化症的致病基因

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员