三角辍学:可变网络宽度,无再训练 (Triangular Dropout: Variable Network Width without Retraining) - 专知论文

会员服务 ·

0

暂退法 · 可约的 · Networking · 宽度 · 层 ·

2022 年 5 月 2 日

Triangular Dropout: Variable Network Width without Retraining

翻译：三角辍学:可变网络宽度,无再训练

Edward W. Staley,Jared Markowitz

One of the most fundamental design choices in neural networks is layer width: it affects the capacity of what a network can learn and determines the complexity of the solution. This latter property is often exploited when introducing information bottlenecks, forcing a network to learn compressed representations. However, such an architecture decision is typically immutable once training begins; switching to a more compressed architecture requires retraining. In this paper we present a new layer design, called Triangular Dropout, which does not have this limitation. After training, the layer can be arbitrarily reduced in width to exchange performance for narrowness. We demonstrate the construction and potential use cases of such a mechanism in three areas. Firstly, we describe the formulation of Triangular Dropout in autoencoders, creating models with selectable compression after training. Secondly, we add Triangular Dropout to VGG19 on ImageNet, creating a powerful network which, without retraining, can be significantly reduced in parameters. Lastly, we explore the application of Triangular Dropout to reinforcement learning (RL) policies on selected control problems.

翻译：神经网络中最根本的设计选择之一是分层宽度:它影响到一个网络能够学习的东西的能力,并且决定了解决方案的复杂性。后一种属性在引入信息瓶颈时常常被利用,迫使一个网络学习压缩的表达方式。然而,这种结构决定通常一旦培训开始就不可改变;转换到一个更压缩的结构需要再培训。在本文中,我们提出了一个新的层次设计,称为三角下降,没有这种限制。在培训之后,该层可以任意缩小宽度,以交换狭小的性能。我们展示了这种机制在三个领域的构建和潜在使用案例。首先,我们描述了自动编码器中三角倾弃的配方,在培训后用可选压缩的模型创建。第二,我们在图像网络上将三角倾弃点添加到VGG19,创建一个强大的网络,无需再培训,就可以大大降低参数。最后,我们探索三角淡出用于强化选定控制问题(RL)的学习政策。

0

相关内容

暂退法

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

专知

14+阅读 · 2018年3月30日

miR-143-3p和miR-195-5p低表达在结直肠癌肝转移中的作用与调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

猪链球2型双组份调控系统1910HK/RR调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

MicroRNA-607靶向RKIP调控鼻咽癌放疗敏感性的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Snail/TELB/PAI-1通路在鼻咽癌转移中的作用及临床意义

国家自然科学基金

0+阅读 · 2013年12月31日

EGFR受体介导的肺肿瘤靶向siRNA类脂质载体的构建及转运调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

黄瓜ERF转录因子CsERF1调控耐涝性的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

苹果双链RNA结合蛋白MdDRB调控miRNA合成和农艺性状的分子机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

环境诱导家蚕滞育的CREB调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

GPR30介导的雌激素非基因组效应对硬骨鱼类精子发生过程中生殖细胞凋亡的调控

国家自然科学基金

0+阅读 · 2008年12月31日

Variational Nested Dropout

Arxiv

0+阅读 · 2022年6月17日

Bayesian Model Averaging of Chain Event Graphs for Robust Explanatory Modelling

Arxiv

0+阅读 · 2022年6月17日

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

Arxiv

0+阅读 · 2022年6月16日

Black-box Safety Analysis and Retraining of DNNs based on Feature Extraction and Clustering

Arxiv

0+阅读 · 2022年6月16日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Training Graph Neural Networks with 1000 Layers

Arxiv

13+阅读 · 2021年6月14日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

Spectral Clustering with Graph Neural Networks for Graph Pooling

Arxiv

25+阅读 · 2020年6月3日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

VIP会员

文章信息

相关主题

相关VIP内容

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《军事域人工智能风险、机遇与治理战略指导报告》2025最新76页报告

《杀伤网与精确规模：智能饱和战争时代的战略要务-印度视角》2025最新报告

俄乌冲突的地缘政治与军事教训（万字长文）

《弹药快速效能建模：推进互操作性与技术优势》2025最新26页报告

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

专知

14+阅读 · 2018年3月30日

相关论文

Variational Nested Dropout

Arxiv

0+阅读 · 2022年6月17日

Bayesian Model Averaging of Chain Event Graphs for Robust Explanatory Modelling

Arxiv

0+阅读 · 2022年6月17日

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

Arxiv

0+阅读 · 2022年6月16日

Black-box Safety Analysis and Retraining of DNNs based on Feature Extraction and Clustering

Arxiv

0+阅读 · 2022年6月16日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Training Graph Neural Networks with 1000 Layers

Arxiv

13+阅读 · 2021年6月14日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

Spectral Clustering with Graph Neural Networks for Graph Pooling

Arxiv

25+阅读 · 2020年6月3日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

相关基金

miR-143-3p和miR-195-5p低表达在结直肠癌肝转移中的作用与调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

猪链球2型双组份调控系统1910HK/RR调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

MicroRNA-607靶向RKIP调控鼻咽癌放疗敏感性的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Snail/TELB/PAI-1通路在鼻咽癌转移中的作用及临床意义

国家自然科学基金

0+阅读 · 2013年12月31日

EGFR受体介导的肺肿瘤靶向siRNA类脂质载体的构建及转运调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

黄瓜ERF转录因子CsERF1调控耐涝性的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

苹果双链RNA结合蛋白MdDRB调控miRNA合成和农艺性状的分子机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

环境诱导家蚕滞育的CREB调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

GPR30介导的雌激素非基因组效应对硬骨鱼类精子发生过程中生殖细胞凋亡的调控

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员