压缩两层神经网络时的尖锐消弹器 (Sharp asymptotics on the compression of two-layer neural networks) - 专知论文

会员服务 ·

0

Networking · Weight · Neural Networks · 泛函 · 过度参数化 ·

2022 年 5 月 18 日

Sharp asymptotics on the compression of two-layer neural networks

翻译：压缩两层神经网络时的尖锐消弹器

Mohammad Hossein Amani,Simone Bombari,Marco Mondelli,Rattana Pukdee,Stefano Rini

In this paper, we study the compression of a target two-layer neural network with N nodes into a compressed network with M < N nodes. More precisely, we consider the setting in which the weights of the target network are i.i.d. sub-Gaussian, and we minimize the population L2 loss between the outputs of the target and of the compressed network, under the assumption of Gaussian inputs. By using tools from high-dimensional probability, we show that this non-convex problem can be simplified when the target network is sufficiently over-parameterized, and provide the error rate of this approximation as a function of the input dimension and N . For a ReLU activation function, we conjecture that the optimum of the simplified optimization problem is achieved by taking weights on the Equiangular Tight Frame (ETF), while the scaling of the weights and the orientation of the ETF depend on the parameters of the target network. Numerical evidence is provided to support this conjecture.

翻译：在本文中,我们研究将带有N节点的目标双层神经网络压缩为M < N节点的压缩网络。更准确地说, 我们考虑目标网络的重量是 i. d. sub- Gausian, 并在高斯输入假设下将目标输出与压缩网络的L2 之间的人口损失最小化。通过使用高斯输入的高度概率工具, 我们显示, 当目标网络足够过分时, 非电离层问题可以简化, 并且提供这种近似的误差率, 作为输入维和 N 的函数。对于ReLU 激活功能, 我们推测, 简化优化问题的最佳方式是通过对视角框架(EQUTF)的重量和方向进行加权, 而 ETF 的重量和方向则取决于目标网络的参数。提供数字证据支持这一推断。

0

相关内容

Networking

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【图深度学习GDL论文大全】A comprehensive collection of recent papers on graph deep learning

【图深度学习GDL论文大全】A comprehensive collection of recent papers on graph deep learning

专知会员服务

47+阅读 · 2019年12月1日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

动力学涨落对网络结构的影响

国家自然科学基金

0+阅读 · 2015年12月31日

铁磁/铁电异质外延结构的界面磁电耦合和自旋电子输运的微观机制

国家自然科学基金

0+阅读 · 2014年12月31日

靶向MEF2C/HDACs相互作用小分子化合物CC1007抗急性淋巴细胞白血病的效应及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PD-1/PD-L1通路介导手术创伤后T淋巴细胞功能障碍的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

肝细胞肝癌中高表达的PRC1基因功能及其受CTCF调控的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

非一致指数二分与伪轨跟踪

国家自然科学基金

0+阅读 · 2013年12月31日

基于聚合物仿生纳米通道的高灵敏microRNA生物传感器

国家自然科学基金

0+阅读 · 2013年12月31日

强各向异性Be薄膜的晶粒细化和应力弛豫机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

高分子薄膜形成动力学与微观机理的耗散粒子动力学模拟研究

国家自然科学基金

0+阅读 · 2009年12月31日

Numerical Identification of Nonlocal Potential in Aggregation

Arxiv

0+阅读 · 2022年7月7日

On the implied weights of linear regression for causal inference

Arxiv

0+阅读 · 2022年7月7日

Characterizing and Mitigating the Difficulty in Training Physics-informed Artificial Neural Networks under Pointwise Constraints

Arxiv

0+阅读 · 2022年7月6日

Spike Calibration: Fast and Accurate Conversion of Spiking Neural Network for Object Detection and Segmentation

Spike Calibration: Fast and Accurate Conversion of Spiking Neural Network for Object Detection and Segmentation

Arxiv

0+阅读 · 2022年7月6日

Deep Learning Approximation of Diffeomorphisms via Linear-Control Systems

Arxiv

0+阅读 · 2022年7月6日

Adaptive deep learning for nonparametric time series regression

Arxiv

0+阅读 · 2022年7月6日

Statistical inference of random graphs with a surrogate likelihood function

Arxiv

0+阅读 · 2022年7月4日

Deep Network Approximation: Achieving Arbitrary Accuracy with Fixed Number of Neurons

Arxiv

0+阅读 · 2022年7月4日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

VIP会员

文章信息

相关主题

Neural Networks

过度参数化

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【图深度学习GDL论文大全】A comprehensive collection of recent papers on graph deep learning

【图深度学习GDL论文大全】A comprehensive collection of recent papers on graph deep learning

专知会员服务

47+阅读 · 2019年12月1日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Numerical Identification of Nonlocal Potential in Aggregation

Arxiv

0+阅读 · 2022年7月7日

On the implied weights of linear regression for causal inference

Arxiv

0+阅读 · 2022年7月7日

Characterizing and Mitigating the Difficulty in Training Physics-informed Artificial Neural Networks under Pointwise Constraints

Arxiv

0+阅读 · 2022年7月6日

Spike Calibration: Fast and Accurate Conversion of Spiking Neural Network for Object Detection and Segmentation

Spike Calibration: Fast and Accurate Conversion of Spiking Neural Network for Object Detection and Segmentation

Arxiv

0+阅读 · 2022年7月6日

Deep Learning Approximation of Diffeomorphisms via Linear-Control Systems

Arxiv

0+阅读 · 2022年7月6日

Adaptive deep learning for nonparametric time series regression

Arxiv

0+阅读 · 2022年7月6日

Statistical inference of random graphs with a surrogate likelihood function

Arxiv

0+阅读 · 2022年7月4日

Deep Network Approximation: Achieving Arbitrary Accuracy with Fixed Number of Neurons

Arxiv

0+阅读 · 2022年7月4日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

相关基金

动力学涨落对网络结构的影响

国家自然科学基金

0+阅读 · 2015年12月31日

铁磁/铁电异质外延结构的界面磁电耦合和自旋电子输运的微观机制

国家自然科学基金

0+阅读 · 2014年12月31日

靶向MEF2C/HDACs相互作用小分子化合物CC1007抗急性淋巴细胞白血病的效应及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PD-1/PD-L1通路介导手术创伤后T淋巴细胞功能障碍的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

肝细胞肝癌中高表达的PRC1基因功能及其受CTCF调控的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

非一致指数二分与伪轨跟踪

国家自然科学基金

0+阅读 · 2013年12月31日

基于聚合物仿生纳米通道的高灵敏microRNA生物传感器

国家自然科学基金

0+阅读 · 2013年12月31日

强各向异性Be薄膜的晶粒细化和应力弛豫机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

高分子薄膜形成动力学与微观机理的耗散粒子动力学模拟研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员