神经网络接近BV功能的神经网络:规律理论方法 (Approximation of BV functions by neural networks: A regularity theory approach) - 专知论文

会员服务 ·

0

泛函 · 近似 · Weight · 正则化项 · 代价函数 ·

2021 年 4 月 1 日

Approximation of BV functions by neural networks: A regularity theory approach

翻译：神经网络接近BV功能的神经网络:规律理论方法

Benny Avelin,Vesa Julin

In this paper we are concerned with the approximation of functions by single hidden layer neural networks with ReLU activation functions on the unit circle. In particular, we are interested in the case when the number of data-points exceeds the number of nodes. We first study the convergence to equilibrium of the stochastic gradient flow associated with the cost function with a quadratic penalization. Specifically, we prove a Poincar\'e inequality for a penalized version of the cost function with explicit constants that are independent of the data and of the number of nodes. As our penalization biases the weights to be bounded, this leads us to study how well a network with bounded weights can approximate a given function of bounded variation (BV). Our main contribution concerning approximation of BV functions, is a result which we call the localization theorem. Specifically, it states that the expected error of the constrained problem, where the length of the weights are less than $R$, is of order $R^{-1/9}$ with respect to the unconstrained problem (the global optimum). The proof is novel in this topic and is inspired by techniques from regularity theory of elliptic partial differential equations. Finally we quantify the expected value of the global optimum by proving a quantitative version of the universal approximation theorem.

翻译：在本文中,我们关注单层隐蔽神经网络的功能近似与单位圆上的RELU激活功能。特别是, 我们感兴趣的是, 当数据点数超过节点数时, 我们首先研究与成本函数相关的随机梯度流的趋同性是否趋同, 并带有二次惩罚性。具体地说, 我们证明, 在成本函数中, 受处罚的版本中, 与数据和节点数无关的明显常数, 具有Poincar\'e的不平等性。由于我们的惩罚性偏向了要约束的权重, 这导致我们研究一个具有约束性加权数的网络如何能与受约束变异( BV) 的某一功能相近。我们有关BV 函数近性的主要贡献是我们称之为本地化标定的结果。具体地, 它指出, 当重量长度低于美元时, 受限的问题的明定常数值与未受限制的问题( 全球最佳度) 相比, 是 $R ⁇ -1/ 9} 。。这让我们研究受约束的重重重的网络, 。。这个专题中的证据是新颖的, 和最终由常规的精确的理论所激励。

0

相关内容

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

29+阅读 · 2020年6月4日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

最新图学习推荐系统综述 | Graph Learning Approaches to Recommender Systems

最新图学习推荐系统综述 | Graph Learning Approaches to Recommender Systems

机器学习与推荐算法

5+阅读 · 2020年4月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Properties of the After Kernel

Properties of the After Kernel

Arxiv

0+阅读 · 2021年5月27日

Tensor numerical method for optimal control problems constrained by an elliptic operator with general rank-structured coefficients

Tensor numerical method for optimal control problems constrained by an elliptic operator with general rank-structured coefficients

Arxiv

0+阅读 · 2021年5月27日

The Many Faces of 1-Lipschitz Neural Networks

Arxiv

1+阅读 · 2021年5月27日

Weak approximation for stochastic differential equations with jumps by iteration and hard bounds

Arxiv

0+阅读 · 2021年5月27日

A finite element method for Allen-Cahn equation on deforming surface

Arxiv

0+阅读 · 2021年5月26日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Density estimation: an inflation-deflation approach

Arxiv

0+阅读 · 2021年5月25日

Learning Theory for Estimation of Animal Motion Submanifolds

Learning Theory for Estimation of Animal Motion Submanifolds

Arxiv

0+阅读 · 2021年5月25日

Approximation Ratios of Graph Neural Networks for Combinatorial Problems

Arxiv

7+阅读 · 2019年5月24日

A Dual Approach to Scalable Verification of Deep Networks

A Dual Approach to Scalable Verification of Deep Networks

Arxiv

3+阅读 · 2018年8月3日

VIP会员

文章信息

相关主题

相关VIP内容

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

29+阅读 · 2020年6月4日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

最新图学习推荐系统综述 | Graph Learning Approaches to Recommender Systems

最新图学习推荐系统综述 | Graph Learning Approaches to Recommender Systems

机器学习与推荐算法

5+阅读 · 2020年4月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Properties of the After Kernel

Properties of the After Kernel

Arxiv

0+阅读 · 2021年5月27日

Tensor numerical method for optimal control problems constrained by an elliptic operator with general rank-structured coefficients

Tensor numerical method for optimal control problems constrained by an elliptic operator with general rank-structured coefficients

Arxiv

0+阅读 · 2021年5月27日

The Many Faces of 1-Lipschitz Neural Networks

Arxiv

1+阅读 · 2021年5月27日

Weak approximation for stochastic differential equations with jumps by iteration and hard bounds

Arxiv

0+阅读 · 2021年5月27日

A finite element method for Allen-Cahn equation on deforming surface

Arxiv

0+阅读 · 2021年5月26日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Density estimation: an inflation-deflation approach

Arxiv

0+阅读 · 2021年5月25日

Learning Theory for Estimation of Animal Motion Submanifolds

Learning Theory for Estimation of Animal Motion Submanifolds

Arxiv

0+阅读 · 2021年5月25日

Approximation Ratios of Graph Neural Networks for Combinatorial Problems

Arxiv

7+阅读 · 2019年5月24日

A Dual Approach to Scalable Verification of Deep Networks

A Dual Approach to Scalable Verification of Deep Networks

Arxiv

3+阅读 · 2018年8月3日

微信扫码咨询专知VIP会员