随机随机初始化神经网络 (Pruning Randomly Initialized Neural Networks with Iterative Randomization) - 专知论文

会员服务 ·

0

Weight · 剪枝 · Neural Networks · Networking · Performer ·

2021 年 6 月 17 日

Pruning Randomly Initialized Neural Networks with Iterative Randomization

翻译：随机随机初始化神经网络

Daiki Chijiwa,Shin'ya Yamaguchi,Yasutoshi Ida,Kenji Umakoshi,Tomohiro Inoue

from arxiv, Code will be available at https://github.com/dchiji-ntt/iterand

Pruning the weights of randomly initialized neural networks plays an important role in the context of lottery ticket hypothesis. Ramanujan et al. (2020) empirically showed that only pruning the weights can achieve remarkable performance instead of optimizing the weight values. However, to achieve the same level of performance as the weight optimization, the pruning approach requires more parameters in the networks before pruning and thus more memory space. To overcome this parameter inefficiency, we introduce a novel framework to prune randomly initialized neural networks with iteratively randomizing weight values (IteRand). Theoretically, we prove an approximation theorem in our framework, which indicates that the randomizing operations are provably effective to reduce the required number of the parameters. We also empirically demonstrate the parameter efficiency in multiple experiments on CIFAR-10 and ImageNet.

翻译：随机初始神经网络的权重在彩票假设中起着重要作用。 Ramanujan等人(2020年)从经验上表明,只有调整重量才能取得显著的性能,而不是优化重量值。然而,为了达到与重量优化相同的性能水平,修剪方法要求在裁剪之前在网络中增加参数,从而增加记忆空间。为了克服这一效率低下的参数,我们引入了一个新框架,利用迭接随机权重值(IteRand)随机启动神经网络。理论上,我们证明了我们框架中的近似理论,这表明随机操作对于减少所需参数数量非常有效。我们还在对CIFAR-10和图像网络的多次实验中以实验方式展示参数效率。

1

相关内容

Weight

最新《对抗机器学习》报告，EPFL-Volkan教授讲解AML中的优化问题

最新《对抗机器学习》报告，EPFL-Volkan教授讲解AML中的优化问题

专知会员服务

36+阅读 · 2021年1月14日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

108+阅读 · 2020年8月4日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【KDD2020】自适应多通道图卷积神经网络

【KDD2020】自适应多通道图卷积神经网络

专知会员服务

121+阅读 · 2020年7月9日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【亚马逊网络服务总监Alexander J. Smola报告】深度学习注意力机制-Attention in Deep learning-附101页PPT

【亚马逊网络服务总监Alexander J. Smola报告】深度学习注意力机制-Attention in Deep learning-附101页PPT

专知会员服务

68+阅读 · 2019年6月11日

深度神经网络压缩和加速相关最全资源分享

深度神经网络压缩和加速相关最全资源分享

深度学习与NLP

3+阅读 · 2019年7月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

PRL导读-2018年120卷15期

PRL导读-2018年120卷15期

中科院物理所

4+阅读 · 2018年4月23日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Confidence Adaptive Regularization for Deep Learning with Noisy Labels

Arxiv

0+阅读 · 2021年8月18日

Deeper or Wider Networks of Point Clouds with Self-attention?

Arxiv

0+阅读 · 2021年8月14日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks

Arxiv

4+阅读 · 2021年7月5日

ResMLP: Feedforward networks for image classification with data-efficient training

Arxiv

12+阅读 · 2021年5月7日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

ML-Net: multi-label classification of biomedical texts with deep neural networks

ML-Net: multi-label classification of biomedical texts with deep neural networks

Arxiv

7+阅读 · 2018年11月15日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

Pointer Networks

Arxiv

4+阅读 · 2017年1月2日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

最新《对抗机器学习》报告，EPFL-Volkan教授讲解AML中的优化问题

最新《对抗机器学习》报告，EPFL-Volkan教授讲解AML中的优化问题

专知会员服务

36+阅读 · 2021年1月14日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

108+阅读 · 2020年8月4日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【KDD2020】自适应多通道图卷积神经网络

【KDD2020】自适应多通道图卷积神经网络

专知会员服务

121+阅读 · 2020年7月9日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【亚马逊网络服务总监Alexander J. Smola报告】深度学习注意力机制-Attention in Deep learning-附101页PPT

【亚马逊网络服务总监Alexander J. Smola报告】深度学习注意力机制-Attention in Deep learning-附101页PPT

专知会员服务

68+阅读 · 2019年6月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

深度神经网络压缩和加速相关最全资源分享

深度神经网络压缩和加速相关最全资源分享

深度学习与NLP

3+阅读 · 2019年7月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

PRL导读-2018年120卷15期

PRL导读-2018年120卷15期

中科院物理所

4+阅读 · 2018年4月23日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Confidence Adaptive Regularization for Deep Learning with Noisy Labels

Arxiv

0+阅读 · 2021年8月18日

Deeper or Wider Networks of Point Clouds with Self-attention?

Arxiv

0+阅读 · 2021年8月14日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks

Arxiv

4+阅读 · 2021年7月5日

ResMLP: Feedforward networks for image classification with data-efficient training

Arxiv

12+阅读 · 2021年5月7日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

ML-Net: multi-label classification of biomedical texts with deep neural networks

ML-Net: multi-label classification of biomedical texts with deep neural networks

Arxiv

7+阅读 · 2018年11月15日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

Pointer Networks

Arxiv

4+阅读 · 2017年1月2日

微信扫码咨询专知VIP会员