压缩不易变通用弹道的重帆重重重量矩阵 (Compressing Heavy-Tailed Weight Matrices for Non-Vacuous Generalization Bounds) - 专知论文

会员服务 ·

0

泛化理论 · Weight · 相关系数 · Networking · 稀疏权重 ·

2021 年 5 月 23 日

Compressing Heavy-Tailed Weight Matrices for Non-Vacuous Generalization Bounds

翻译：压缩不易变通用弹道的重帆重重重量矩阵

Heavy-tailed distributions have been studied in statistics, random matrix theory, physics, and econometrics as models of correlated systems, among other domains. Further, heavy-tail distributed eigenvalues of the covariance matrix of the weight matrices in neural networks have been shown to empirically correlate with test set accuracy in several works (e.g. arXiv:1901.08276), but a formal relationship between heavy-tail distributed parameters and generalization bounds was yet to be demonstrated. In this work, the compression framework of arXiv:1802.05296 is utilized to show that matrices with heavy-tail distributed matrix elements can be compressed, resulting in networks with sparse weight matrices. Since the parameter count has been reduced to a sum of the non-zero elements of sparse matrices, the compression framework allows us to bound the generalization gap of the resulting compressed network with a non-vacuous generalization bound. Further, the action of these matrices on a vector is discussed, and how they may relate to compression and resilient classification is analyzed.

翻译：在统计、随机矩阵理论、物理和计量经济学等领域中,对作为相关系统模型的重尾分配分布进行了研究;此外,神经网络中重质矩阵共变矩阵的重尾分布性电子价值被证明与若干工程(例如arXiv:1901.08276)的测试设定准确性有经验关联,但重尾分布参数和一般化界限之间的正式关系尚未得到证明;在这项工作中,使用arXiv:1802.05296的压缩框架表明,重尾分布矩阵元素的压缩可导致网络重量矩阵稀少;由于参数计数已减为稀薄矩阵的非零要素之和,压缩框架使我们得以将由此产生的压缩网络的普遍差距与非真空的统称捆绑起来;此外,还讨论了这些矩阵对矢量的动作,并分析了它们与压缩和弹性分类的关系。

0

相关内容

泛化理论

【经典书】线性代数，436页pdf

专知会员服务

77+阅读 · 2021年3月16日

IJCAI2020接受论文列表，592篇论文pdf都在这了！

IJCAI2020接受论文列表，592篇论文pdf都在这了！

专知会员服务

64+阅读 · 2020年7月16日

一份循环神经网络RNNs简明教程，37页ppt

一份循环神经网络RNNs简明教程，37页ppt

专知会员服务

173+阅读 · 2020年5月6日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

55+阅读 · 2020年3月26日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【斯坦福大学CS229】面向机器学习的线性代数和微积分要点速览(中文版)《CS 229 - Linear Algebra and Calculus refresher》by Afshine Amidi, Shervine Amidi

【斯坦福大学CS229】面向机器学习的线性代数和微积分要点速览(中文版)《CS 229 - Linear Algebra and Calculus refresher》by Afshine Amidi, Shervine Amidi

专知会员服务

196+阅读 · 2019年12月19日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

已删除

将门创投

12+阅读 · 2017年10月13日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Near-Optimal Algorithm for Distribution-Free Junta Testing

Arxiv

0+阅读 · 2021年7月13日

Joint Matrix Decomposition for Deep Convolutional Neural Networks Compression

Joint Matrix Decomposition for Deep Convolutional Neural Networks Compression

Arxiv

0+阅读 · 2021年7月12日

Lugsail lag windows for estimating time-average covariance matrices

Arxiv

0+阅读 · 2021年7月11日

Analysis of Smooth Heaps and Slim Heaps

Arxiv

0+阅读 · 2021年7月10日

Hölder Bounds for Sensitivity Analysis in Causal Reasoning

Arxiv

0+阅读 · 2021年7月9日

A Moment Majorization principle for random matrix ensembles

Arxiv

0+阅读 · 2021年7月9日

Divide and conquer methods for functions of matrices with banded or hierarchical low-rank structure

Divide and conquer methods for functions of matrices with banded or hierarchical low-rank structure

Arxiv

0+阅读 · 2021年7月9日

Neural-Network-Optimized Degree-Specific Weights for LDPC MinSum Decoding

Arxiv

0+阅读 · 2021年7月9日

Diagonal Nonlinear Transformations Preserve Structure in Covariance and Precision Matrices

Arxiv

0+阅读 · 2021年7月8日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

VIP会员

文章信息

相关主题

相关VIP内容

【经典书】线性代数，436页pdf

专知会员服务

77+阅读 · 2021年3月16日

IJCAI2020接受论文列表，592篇论文pdf都在这了！

IJCAI2020接受论文列表，592篇论文pdf都在这了！

专知会员服务

64+阅读 · 2020年7月16日

一份循环神经网络RNNs简明教程，37页ppt

一份循环神经网络RNNs简明教程，37页ppt

专知会员服务

173+阅读 · 2020年5月6日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

55+阅读 · 2020年3月26日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【斯坦福大学CS229】面向机器学习的线性代数和微积分要点速览(中文版)《CS 229 - Linear Algebra and Calculus refresher》by Afshine Amidi, Shervine Amidi

【斯坦福大学CS229】面向机器学习的线性代数和微积分要点速览(中文版)《CS 229 - Linear Algebra and Calculus refresher》by Afshine Amidi, Shervine Amidi

专知会员服务

196+阅读 · 2019年12月19日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025】具有局部对齐视觉-语言模型的可解释零样本学习

中国AI行业系列观察报告：穿越资讯迷雾，重塑AI认知

走向通用人工智能之路，世界模型为何不可或缺？

最新中文版7000字 | 无人机与作战革命：美国陆军致力于无人化时代

相关资讯

已删除

将门创投

12+阅读 · 2017年10月13日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Near-Optimal Algorithm for Distribution-Free Junta Testing

Arxiv

0+阅读 · 2021年7月13日

Joint Matrix Decomposition for Deep Convolutional Neural Networks Compression

Joint Matrix Decomposition for Deep Convolutional Neural Networks Compression

Arxiv

0+阅读 · 2021年7月12日

Lugsail lag windows for estimating time-average covariance matrices

Arxiv

0+阅读 · 2021年7月11日

Analysis of Smooth Heaps and Slim Heaps

Arxiv

0+阅读 · 2021年7月10日

Hölder Bounds for Sensitivity Analysis in Causal Reasoning

Arxiv

0+阅读 · 2021年7月9日

A Moment Majorization principle for random matrix ensembles

Arxiv

0+阅读 · 2021年7月9日

Divide and conquer methods for functions of matrices with banded or hierarchical low-rank structure

Divide and conquer methods for functions of matrices with banded or hierarchical low-rank structure

Arxiv

0+阅读 · 2021年7月9日

Neural-Network-Optimized Degree-Specific Weights for LDPC MinSum Decoding

Arxiv

0+阅读 · 2021年7月9日

Diagonal Nonlinear Transformations Preserve Structure in Covariance and Precision Matrices

Arxiv

0+阅读 · 2021年7月8日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

微信扫码咨询专知VIP会员