聚合网络与高斯进程之间的趋同率 (Rate of Convergence of Polynomial Networks to Gaussian Processes) - 专知论文

会员服务 ·

0

Networking · Processing（编程语言） · ReLU · 分解的 · Neural Networks ·

2021 年 11 月 4 日

Rate of Convergence of Polynomial Networks to Gaussian Processes

翻译：聚合网络与高斯进程之间的趋同率

from arxiv, 23 pages (13 for the main body)

We examine one-hidden-layer neural networks with random weights. It is well-known that in the limit of infinitely many neurons they simplify to Gaussian processes. For networks with a polynomial activation, we demonstrate that the rate of this convergence in 2-Wasserstein metric is $O(n^{-\frac{1}{2}})$, where $n$ is the number of hidden neurons. We suspect this rate is asymptotically sharp. We improve the known convergence rate for other activations, to power-law in $n$ for ReLU and inverse-square-root up to logarithmic factors for erf. We explore the interplay between spherical harmonics, Stein kernels and optimal transport in the non-isotropic setting.

翻译：我们用随机重量检查一个隐藏层神经网络。众所周知, 在无限多神经元的限度内, 它们会简化到高斯进程。对于具有多元激活作用的网络, 我们证明, 2- Wasserstein 公制的这种趋同速度是 $O (n)-\\\ frac{1 ⁇ 2 ⁇ ) $, 其中一美元是隐藏的神经元的数量。我们怀疑这个速度在瞬间是惊人的。我们提高了其他激活的已知趋同率, 将ReLU 的功率提高到 $( $) 和反平方根到 erf 的对数系数。我们探索了球调、 Stech 内核以及非粒子环境中的最佳运输方式之间的相互作用。

0

相关内容

Networking

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

87+阅读 · 2021年12月9日

【硬核书】树与网络上的概率，716页pdf

【硬核书】树与网络上的概率，716页pdf

专知会员服务

77+阅读 · 2021年12月8日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

57+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【清华大学】自动微分蒙特卡洛，理论与应用，Automatic Differentiable Monte Carlo: Theory and Application (附pdf）

专知会员服务

28+阅读 · 2019年11月23日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

《自然》（20190221出版）一周论文导读

《自然》（20190221出版）一周论文导读

科学网

6+阅读 · 2019年2月23日

已删除

将门创投

6+阅读 · 2018年12月3日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Bregman divergence based em algorithm and its application to classical and quantum rate distortion theory

Arxiv

0+阅读 · 2022年1月7日

Analyticity and sparsity in uncertainty quantification for PDEs with Gaussian random field inputs

Arxiv

0+阅读 · 2022年1月6日

Functional-Input Gaussian Processes with Applications to Inverse Scattering Problems

Arxiv

0+阅读 · 2022年1月5日

Convergence and Complexity of Stochastic Block Majorization-Minimization

Arxiv

0+阅读 · 2022年1月5日

Approximate Spectral Decomposition of Fisher Information Matrix for Simple ReLU Networks

Arxiv

0+阅读 · 2022年1月5日

Conditional Monte Carlo for Reaction Networks

Arxiv

0+阅读 · 2022年1月4日

Optimal design of the Barker proposal and other locally-balanced Metropolis-Hastings algorithms

Arxiv

0+阅读 · 2022年1月4日

Uniform Convergence of Interpolators: Gaussian Width, Norm Bounds, and Benign Overfitting

Arxiv

0+阅读 · 2022年1月4日

Deep Convolutional Networks as shallow Gaussian Processes

Arxiv

4+阅读 · 2018年8月16日

A Dual Approach to Scalable Verification of Deep Networks

A Dual Approach to Scalable Verification of Deep Networks

Arxiv

3+阅读 · 2018年8月3日

VIP会员

文章信息

相关主题

Processing（编程语言）

Neural Networks

相关VIP内容

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

87+阅读 · 2021年12月9日

【硬核书】树与网络上的概率，716页pdf

【硬核书】树与网络上的概率，716页pdf

专知会员服务

77+阅读 · 2021年12月8日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

57+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【清华大学】自动微分蒙特卡洛，理论与应用，Automatic Differentiable Monte Carlo: Theory and Application (附pdf）

专知会员服务

28+阅读 · 2019年11月23日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

《自然》（20190221出版）一周论文导读

《自然》（20190221出版）一周论文导读

科学网

6+阅读 · 2019年2月23日

已删除

将门创投

6+阅读 · 2018年12月3日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Bregman divergence based em algorithm and its application to classical and quantum rate distortion theory

Arxiv

0+阅读 · 2022年1月7日

Analyticity and sparsity in uncertainty quantification for PDEs with Gaussian random field inputs

Arxiv

0+阅读 · 2022年1月6日

Functional-Input Gaussian Processes with Applications to Inverse Scattering Problems

Arxiv

0+阅读 · 2022年1月5日

Convergence and Complexity of Stochastic Block Majorization-Minimization

Arxiv

0+阅读 · 2022年1月5日

Approximate Spectral Decomposition of Fisher Information Matrix for Simple ReLU Networks

Arxiv

0+阅读 · 2022年1月5日

Conditional Monte Carlo for Reaction Networks

Arxiv

0+阅读 · 2022年1月4日

Optimal design of the Barker proposal and other locally-balanced Metropolis-Hastings algorithms

Arxiv

0+阅读 · 2022年1月4日

Uniform Convergence of Interpolators: Gaussian Width, Norm Bounds, and Benign Overfitting

Arxiv

0+阅读 · 2022年1月4日

Deep Convolutional Networks as shallow Gaussian Processes

Arxiv

4+阅读 · 2018年8月16日

A Dual Approach to Scalable Verification of Deep Networks

A Dual Approach to Scalable Verification of Deep Networks

Arxiv

3+阅读 · 2018年8月3日

微信扫码咨询专知VIP会员