关于测量神经网络能力过剩问题 (On Measuring Excess Capacity in Neural Networks) - 专知论文

会员服务 ·

0

Networking · Weight · Neural Networks · Lipschitz常数 · 经验误差 ·

2023 年 1 月 19 日

On Measuring Excess Capacity in Neural Networks

翻译：关于测量神经网络能力过剩问题

Florian Graf,Sebastian Zeng,Bastian Rieck,Marc Niethammer,Roland Kwitt

from arxiv, Updated to Neurips 2022 camera-ready version

We study the excess capacity of deep networks in the context of supervised classification. That is, given a capacity measure of the underlying hypothesis class - in our case, empirical Rademacher complexity - to what extent can we (a priori) constrain this class while retaining an empirical error on a par with the unconstrained regime? To assess excess capacity in modern architectures (such as residual networks), we extend and unify prior Rademacher complexity bounds to accommodate function composition and addition, as well as the structure of convolutions. The capacity-driving terms in our bounds are the Lipschitz constants of the layers and an (2, 1) group norm distance to the initializations of the convolution weights. Experiments on benchmark datasets of varying task difficulty indicate that (1) there is a substantial amount of excess capacity per task, and (2) capacity can be kept at a surprisingly similar level across tasks. Overall, this suggests a notion of compressibility with respect to weight norms, complementary to classic compression via weight pruning. Source code is available at https://github.com/rkwitt/excess_capacity.

翻译：我们从监督分类的角度研究深层网络的过剩能力,也就是说,根据对基本假设等级的能力量度 -- -- 就我们而言,经验型雷德马赫复杂程度 -- -- 我们(先验性)能够在多大程度上约束这一类别,同时保留与不受限制的制度相同的经验错误?为了评估现代建筑(如残余网络)的过剩能力,我们扩展和统一了先前的雷德马赫复杂的界限,以适应功能构成和增加,以及组合结构。我们的界限中能力驱动术语是层层的利普西茨常数和(2,1)组标准距离共振量重量初始化的距离。任务难度不同的基准数据集实验表明:(1) 每项任务有大量的超载能力,(2) 能力可以保持在惊人的类似水平。总体而言,这表明了与重量规范的可压缩性概念,通过重量调整对经典压缩进行补充。源码见https://github.com/rkwitt/excess_capit。

0

相关内容

Networking

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

《Advances in Graph Neural Networks》第1~2章读书笔记

《Advances in Graph Neural Networks》第1~2章读书笔记

专知会员服务

42+阅读 · 2022年11月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

非酯化脂肪酸对隐性乳腺炎奶牛中性粒细胞跨内皮迁移的影响机制

国家自然科学基金

0+阅读 · 2014年12月31日

小尺度电离层扰动的TEC起伏特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

半夏泻心汤调节2型糖尿病人GLP-1和β细胞功能的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

磷酸钒锂在多电子反应过程中的结构变化和动力学行为

国家自然科学基金

0+阅读 · 2012年12月31日

基于MeCP2甲基化调控枯否细胞分泌炎症因子的栀子苷抗肝纤维化机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

图在曲面上嵌入的分类

国家自然科学基金

0+阅读 · 2011年12月31日

ICF中高能电子和离子输运的Monte-Carlo算法研究和程序研制

国家自然科学基金

0+阅读 · 2009年12月31日

Symbolic Synthesis of Neural Networks

Arxiv

0+阅读 · 2023年3月14日

Geometric dual and sum-rank minimal codes

Arxiv

0+阅读 · 2023年3月13日

Universal coding, intrinsic volumes, and metric complexity

Arxiv

0+阅读 · 2023年3月13日

Evaluating Visual Number Discrimination in Deep Neural Networks

Arxiv

0+阅读 · 2023年3月13日

One Neuron Saved Is One Neuron Earned: On Parametric Efficiency of Quadratic Networks

Arxiv

0+阅读 · 2023年3月11日

Generalization analysis of an unfolding network for analysis-based Compressed Sensing

Arxiv

0+阅读 · 2023年3月9日

Disambiguation of Company names via Deep Recurrent Networks

Arxiv

0+阅读 · 2023年3月7日

Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Arxiv

20+阅读 · 2021年8月30日

Dynamic Neural Networks: A Survey

Arxiv

37+阅读 · 2021年2月10日

Deep Neural Network Based Relation Extraction: An Overview

Arxiv

14+阅读 · 2021年1月6日

VIP会员

文章信息

相关主题

Neural Networks

Lipschitz常数

相关VIP内容

《Advances in Graph Neural Networks》第1~2章读书笔记

《Advances in Graph Neural Networks》第1~2章读书笔记

专知会员服务

42+阅读 · 2022年11月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Symbolic Synthesis of Neural Networks

Arxiv

0+阅读 · 2023年3月14日

Geometric dual and sum-rank minimal codes

Arxiv

0+阅读 · 2023年3月13日

Universal coding, intrinsic volumes, and metric complexity

Arxiv

0+阅读 · 2023年3月13日

Evaluating Visual Number Discrimination in Deep Neural Networks

Arxiv

0+阅读 · 2023年3月13日

One Neuron Saved Is One Neuron Earned: On Parametric Efficiency of Quadratic Networks

Arxiv

0+阅读 · 2023年3月11日

Generalization analysis of an unfolding network for analysis-based Compressed Sensing

Arxiv

0+阅读 · 2023年3月9日

Disambiguation of Company names via Deep Recurrent Networks

Arxiv

0+阅读 · 2023年3月7日

Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Arxiv

20+阅读 · 2021年8月30日

Dynamic Neural Networks: A Survey

Arxiv

37+阅读 · 2021年2月10日

Deep Neural Network Based Relation Extraction: An Overview

Arxiv

14+阅读 · 2021年1月6日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

非酯化脂肪酸对隐性乳腺炎奶牛中性粒细胞跨内皮迁移的影响机制

国家自然科学基金

0+阅读 · 2014年12月31日

小尺度电离层扰动的TEC起伏特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

半夏泻心汤调节2型糖尿病人GLP-1和β细胞功能的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

磷酸钒锂在多电子反应过程中的结构变化和动力学行为

国家自然科学基金

0+阅读 · 2012年12月31日

基于MeCP2甲基化调控枯否细胞分泌炎症因子的栀子苷抗肝纤维化机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

图在曲面上嵌入的分类

国家自然科学基金

0+阅读 · 2011年12月31日

ICF中高能电子和离子输运的Monte-Carlo算法研究和程序研制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员