深Kronecker深海神经网络:具有适应激活功能的神经网络总体框架 (Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions) - 专知论文

会员服务 ·

0

知识神经元网络系统 · Neural Networks · Networks · Networking · 激活函数 ·

2021 年 5 月 20 日

Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions

翻译：深Kronecker深海神经网络:具有适应激活功能的神经网络总体框架

Ameya D. Jagtap,Yeonjong Shin,Kenji Kawaguchi,George Em Karniadakis

from arxiv, 26 pages, 13 figures

We propose a new type of neural networks, Kronecker neural networks (KNNs), that form a general framework for neural networks with adaptive activation functions. KNNs employ the Kronecker product, which provides an efficient way of constructing a very wide network while keeping the number of parameters low. Our theoretical analysis reveals that under suitable conditions, KNNs induce a faster decay of the loss than that by the feed-forward networks. This is also empirically verified through a set of computational examples. Furthermore, under certain technical assumptions, we establish global convergence of gradient descent for KNNs. As a specific case, we propose the Rowdy activation function that is designed to get rid of any saturation region by injecting sinusoidal fluctuations, which include trainable parameters. The proposed Rowdy activation function can be employed in any neural network architecture like feed-forward neural networks, Recurrent neural networks, Convolutional neural networks etc. The effectiveness of KNNs with Rowdy activation is demonstrated through various computational experiments including function approximation using feed-forward neural networks, solution inference of partial differential equations using the physics-informed neural networks, and standard deep learning benchmark problems using convolutional and fully-connected neural networks.

翻译：我们提出一种新的神经网络,即Kronecker神经网络(KNNS),它构成具有适应性激活功能的神经网络总框架。Kronecker产品使用Kronecker产品,它提供了建造非常宽的网络的有效方法,同时保持低参数数量。我们的理论分析表明,在适当条件下,KNNS引起的损失衰减速度要快于供养向前网络的衰减速度。这也通过一系列计算实例得到经验的验证。此外,在某些技术假设下,我们为KNNS建立了全球梯度下降趋同。作为一个具体案例,我们提议了rody激活功能,目的是通过注射鼻线性波动消除任何饱和区,其中包括可训练的参数。拟议的Rondy激活功能可以在任何神经网络结构中应用,例如饲料向上神经网络、循环神经网络、革命神经网络等。通过各种计算实验,包括利用供养性向向上神经网络对调的功能进行近比对准,用物理基础网络和完全基础性神经网络进行部分差异式变换等的公式。

0

相关内容

知识神经元网络系统

知识神经元网络系统

将若干个相关主题的“知识神经元网络”knn (1 ) ， knn( 2 ) ， knn( i )，，，，连接在一起，组成“知识神经元网络系统”KNNS（Knowledge neural network system）。

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知会员服务

90+阅读 · 2020年7月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知

45+阅读 · 2020年7月22日

ICLR 2020会议的16篇最佳深度学习论文

ICLR 2020会议的16篇最佳深度学习论文

AINLP

5+阅读 · 2020年5月12日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【下载】JAVA程序员深度学习实用指引《Deep Learning: Practical Neural Networks》

【下载】JAVA程序员深度学习实用指引《Deep Learning: Practical Neural Networks》

专知

12+阅读 · 2017年12月7日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Combining Transformer Generators with Convolutional Discriminators

Combining Transformer Generators with Convolutional Discriminators

Arxiv

0+阅读 · 2021年7月8日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

Adaptive Universal Generalized PageRank Graph Neural Network

Arxiv

10+阅读 · 2021年1月22日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

Invariance-Preserving Localized Activation Functions for Graph Neural Networks

Invariance-Preserving Localized Activation Functions for Graph Neural Networks

Arxiv

4+阅读 · 2019年11月5日

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels

Arxiv

8+阅读 · 2019年11月4日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

Topology Adaptive Graph Convolutional Networks

Arxiv

3+阅读 · 2018年2月11日

High-Resolution Deep Convolutional Generative Adversarial Networks

Arxiv

8+阅读 · 2018年1月27日

Adaptive Graph Convolutional Neural Networks

Arxiv

7+阅读 · 2018年1月10日

VIP会员

文章信息

相关主题

知识神经元网络系统

Neural Networks

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知会员服务

90+阅读 · 2020年7月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知

45+阅读 · 2020年7月22日

ICLR 2020会议的16篇最佳深度学习论文

ICLR 2020会议的16篇最佳深度学习论文

AINLP

5+阅读 · 2020年5月12日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【下载】JAVA程序员深度学习实用指引《Deep Learning: Practical Neural Networks》

【下载】JAVA程序员深度学习实用指引《Deep Learning: Practical Neural Networks》

专知

12+阅读 · 2017年12月7日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Combining Transformer Generators with Convolutional Discriminators

Combining Transformer Generators with Convolutional Discriminators

Arxiv

0+阅读 · 2021年7月8日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

Adaptive Universal Generalized PageRank Graph Neural Network

Arxiv

10+阅读 · 2021年1月22日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

Invariance-Preserving Localized Activation Functions for Graph Neural Networks

Invariance-Preserving Localized Activation Functions for Graph Neural Networks

Arxiv

4+阅读 · 2019年11月5日

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels

Arxiv

8+阅读 · 2019年11月4日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

Topology Adaptive Graph Convolutional Networks

Arxiv

3+阅读 · 2018年2月11日

High-Resolution Deep Convolutional Generative Adversarial Networks

Arxiv

8+阅读 · 2018年1月27日

Adaptive Graph Convolutional Neural Networks

Arxiv

7+阅读 · 2018年1月10日

微信扫码咨询专知VIP会员