在人工神经网络的粗略捷径地形图上 (On a Sparse Shortcut Topology of Artificial Neural Networks) - 专知论文

会员服务 ·

0

通用近似器 · 近似 · Networking · 稀疏 · 人工神经网络 ·

2021 年 11 月 11 日

On a Sparse Shortcut Topology of Artificial Neural Networks

翻译：在人工神经网络的粗略捷径地形图上

Fenglei Fan,Dayang Wang,Hengtao Guo,Qikui Zhu,Pingkun Yan,Ge Wang,Hengyong Yu

In established network architectures, shortcut connections are often used to take the outputs of earlier layers as additional inputs to later layers. Despite the extraordinary effectiveness of shortcuts, there remain open questions on the mechanism and characteristics. For example, why are shortcuts powerful? Why do shortcuts generalize well? In this paper, we investigate the expressivity and generalizability of a novel sparse shortcut topology. First, we demonstrate that this topology can empower a one-neuron-wide deep network to approximate any univariate continuous function. Then, we present a novel width-bounded universal approximator in contrast to depth-bounded universal approximators and extend the approximation result to a family of equally competent networks. Furthermore, with generalization bound theory, we show that the proposed shortcut topology enjoys excellent generalizability. Finally, we corroborate our theoretical analyses by comparing the proposed topology with popular architectures, including ResNet and DenseNet, on well-known benchmarks and perform a saliency map analysis to interpret the proposed topology. Our work helps enhance the understanding of the role of shortcuts and suggests further opportunities to innovate neural architectures.

翻译：在已有的网络结构中,捷径连接常常被用来将早期层的产出作为附加投入提供给后层。尽管捷径的超常效果,但机制及其特点仍有一些未解的问题。例如,为什么捷径是强大的?为什么捷径是全面的?为什么捷径是全面的?在本文中,我们调查了新颖的稀有捷径地形的表达性和可概括性。首先,我们证明,这一地形学可以使整个单一中子深层的网络能够接近任何单一的连续功能。然后,我们提出了一个新的宽度宽度通用近似器,与深度的通用近似器形成对照,并将近似结果推广到同样合格的网络的大家庭。此外,根据一般化约束理论,我们表明拟议的捷径表学具有极佳的可概括性。最后,我们通过将拟议的地形学与流行结构(包括ResNet和DenseNet)进行比较,根据众所周知的基准进行理论分析,并进行显著的地图分析,以解释拟议的地形学。我们的工作有助于增进对捷径的作用的理解,并提出创新神经结构的进一步机会。

0

相关内容

通用近似器

通用近似器

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【图与几何深度学习，53页ppt】Graph and geometric deep learning

专知会员服务

90+阅读 · 2021年6月14日

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

112+阅读 · 2021年4月17日

【重磅】2021年IEEE Fellow出炉！ 282位新晋升会士！七十多位华人当选！

专知会员服务

23+阅读 · 2020年11月25日

最新《自然语言处理迁移学习》综述论文，A Survey on Transfer Learning in Natural Language Processing

最新《自然语言处理迁移学习》综述论文，A Survey on Transfer Learning in Natural Language Processing

专知会员服务

139+阅读 · 2020年7月10日

深度神经网络中的快捷学习，Shortcut Learning in Deep Neural Networks

深度神经网络中的快捷学习，Shortcut Learning in Deep Neural Networks

专知会员服务

22+阅读 · 2020年4月21日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Nature 一周论文导读 | 2019 年 2 月 21 日

Nature 一周论文导读 | 2019 年 2 月 21 日

科研圈

14+阅读 · 2019年3月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

De Rham compatible Deep Neural Networks

Arxiv

0+阅读 · 2022年1月14日

Training Free Graph Neural Networks for Graph Matching

Arxiv

1+阅读 · 2022年1月14日

Pointwise Binary Classification with Pairwise Confidence Comparisons

Arxiv

0+阅读 · 2022年1月13日

Experimental Design Networks: A Paradigm for Serving Heterogeneous Learners under Networking Constraints

Arxiv

0+阅读 · 2022年1月13日

Relating Graph Neural Networks to Structural Causal Models

Arxiv

44+阅读 · 2021年9月9日

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks

Arxiv

4+阅读 · 2021年7月5日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Dynamic Neural Networks: A Survey

Arxiv

37+阅读 · 2021年2月10日

A Survey on Graph Neural Networks for Knowledge Graph Completion

Arxiv

6+阅读 · 2020年7月24日

GraLSP: Graph Neural Networks with Local Structural Patterns

GraLSP: Graph Neural Networks with Local Structural Patterns

Arxiv

4+阅读 · 2019年11月18日

VIP会员

文章信息

相关主题

通用近似器

人工神经网络

相关VIP内容

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【图与几何深度学习，53页ppt】Graph and geometric deep learning

专知会员服务

90+阅读 · 2021年6月14日

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

112+阅读 · 2021年4月17日

【重磅】2021年IEEE Fellow出炉！ 282位新晋升会士！七十多位华人当选！

专知会员服务

23+阅读 · 2020年11月25日

最新《自然语言处理迁移学习》综述论文，A Survey on Transfer Learning in Natural Language Processing

最新《自然语言处理迁移学习》综述论文，A Survey on Transfer Learning in Natural Language Processing

专知会员服务

139+阅读 · 2020年7月10日

深度神经网络中的快捷学习，Shortcut Learning in Deep Neural Networks

深度神经网络中的快捷学习，Shortcut Learning in Deep Neural Networks

专知会员服务

22+阅读 · 2020年4月21日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Nature 一周论文导读 | 2019 年 2 月 21 日

Nature 一周论文导读 | 2019 年 2 月 21 日

科研圈

14+阅读 · 2019年3月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

De Rham compatible Deep Neural Networks

Arxiv

0+阅读 · 2022年1月14日

Training Free Graph Neural Networks for Graph Matching

Arxiv

1+阅读 · 2022年1月14日

Pointwise Binary Classification with Pairwise Confidence Comparisons

Arxiv

0+阅读 · 2022年1月13日

Experimental Design Networks: A Paradigm for Serving Heterogeneous Learners under Networking Constraints

Arxiv

0+阅读 · 2022年1月13日

Relating Graph Neural Networks to Structural Causal Models

Arxiv

44+阅读 · 2021年9月9日

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks

Arxiv

4+阅读 · 2021年7月5日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Dynamic Neural Networks: A Survey

Arxiv

37+阅读 · 2021年2月10日

A Survey on Graph Neural Networks for Knowledge Graph Completion

Arxiv

6+阅读 · 2020年7月24日

GraLSP: Graph Neural Networks with Local Structural Patterns

GraLSP: Graph Neural Networks with Local Structural Patterns

Arxiv

4+阅读 · 2019年11月18日

微信扫码咨询专知VIP会员