学习利普希茨函数的GD训练的过参数化ReLU神经网络 (Learning Lipschitz Functions by GD-trained Shallow Overparameterized ReLU Neural Networks) - 专知论文

会员服务 ·

0

通用动力公司 · ReLU神经网络 · ReLU · 参数化 · 最优 ·

2023 年 4 月 6 日

Learning Lipschitz Functions by GD-trained Shallow Overparameterized ReLU Neural Networks

翻译：学习利普希茨函数的GD训练的过参数化ReLU神经网络

Ilja Kuzborskij,Csaba Szepesvári

We explore the ability of overparameterized shallow ReLU neural networks to learn Lipschitz, nondifferentiable, bounded functions with additive noise when trained by Gradient Descent (GD). To avoid the problem that in the presence of noise, neural networks trained to nearly zero training error are inconsistent in this class, we focus on the early-stopped GD which allows us to show consistency and optimal rates. In particular, we explore this problem from the viewpoint of the Neural Tangent Kernel (NTK) approximation of a GD-trained finite-width neural network. We show that whenever some early stopping rule is guaranteed to give an optimal rate (of excess risk) on the Hilbert space of the kernel induced by the ReLU activation function, the same rule can be used to achieve minimax optimal rate for learning on the class of considered Lipschitz functions by neural networks. We discuss several data-free and data-dependent practically appealing stopping rules that yield optimal rates.

翻译：我们探讨了过参数化浅层ReLU神经网络通过梯度下降（GD）训练学习利普希茨、不可导、有添加噪声的有界函数的能力。为避免在存在噪声的情况下，神经网络训练到接近零训练误差时在此类中不一致的问题，我们专注于早期停止的GD，使我们能够展现一致性和最优速率。特别地，我们从GD训练有限宽度神经网络引起的ReLU激活函数诱导的核的带权空间的角度探索了这个问题。我们展示了每当某些早期停止规则被保证在核诱导的ReLU激活函数上给出最优速率（超额风险），同样的规则可以被用来在神经网络上实现对所考虑的利普希茨函数的学习的极小化最优速率。我们讨论了几个无需数据和基于数据的具有实际吸引力的停止规则，这些规则产生最优速率。

0

相关内容

通用动力公司

通用动力公司

通用动力公司（General Dynamics）是一家美国的国防企业集团。2008年时通用动力是世界第五大国防工业承包商。由于近年来不断的扩充和并购其他公司，通用动力现今的组成与面貌已与冷战时期时大不相同。现今通用动力包含三大业务集团：海洋、作战系统和资讯科技集团。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

[ICML-Google]先宽后窄:对深度薄网络的有效训练

[ICML-Google]先宽后窄:对深度薄网络的有效训练

专知会员服务

36+阅读 · 2020年7月5日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【论文推荐】 Bidirectional Self-Normalizing Neural Networks：双向自归一化神经网络

【论文推荐】 Bidirectional Self-Normalizing Neural Networks：双向自归一化神经网络

专知会员服务

17+阅读 · 2020年6月22日

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

专知会员服务

17+阅读 · 2020年6月4日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

【谷歌大脑新论文】利用可微摄动优化器进行学习，Learning with Differentiable Perturbed Optimizers

【谷歌大脑新论文】利用可微摄动优化器进行学习，Learning with Differentiable Perturbed Optimizers

专知会员服务

29+阅读 · 2020年2月22日

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

专知会员服务

24+阅读 · 2019年11月4日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

从零学习：从Python和R理解和编码神经网络（完整版）

从零学习：从Python和R理解和编码神经网络（完整版）

论智

24+阅读 · 2017年12月16日

干货|代码原理教你搞懂SGD随机梯度下降、BGD、MBGD

干货|代码原理教你搞懂SGD随机梯度下降、BGD、MBGD

机器学习研究会

12+阅读 · 2017年11月25日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

Erdos-Sos猜想及几个相关的极值组合问题

国家自然科学基金

0+阅读 · 2012年12月31日

基于聚类的复杂网络社团结构发现

国家自然科学基金

0+阅读 · 2012年12月31日

Toeplitz矩阵函数的快速逼近算法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

耗散型Duffing方程的周期解与稳定性

国家自然科学基金

0+阅读 · 2011年12月31日

Mann型迭代算法中若干问题的研究

国家自然科学基金

0+阅读 · 2011年12月31日

周期微分方程与单位圆内的微分方程解的性质

国家自然科学基金

0+阅读 · 2011年12月31日

非线性方程中的拓扑与变分方法

国家自然科学基金

1+阅读 · 2011年12月31日

动力系统周期解与稳定性研究

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

From Tempered to Benign Overfitting in ReLU Neural Networks

Arxiv

0+阅读 · 2023年5月24日

On the Size and Approximation Error of Distilled Sets

Arxiv

0+阅读 · 2023年5月23日

Mind the spikes: Benign overfitting of kernels and neural networks in fixed dimension

Arxiv

0+阅读 · 2023年5月23日

A new efficient explicit Deferred Correction framework: analysis and applications to hyperbolic PDEs and adaptivity

Arxiv

0+阅读 · 2023年5月22日

Rational approximations of operator monotone and operator convex functions

Arxiv

0+阅读 · 2023年5月21日

Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Arxiv

0+阅读 · 2023年5月21日

Complexity of Neural Network Training and ETR: Extensions with Effectively Continuous Functions

Arxiv

0+阅读 · 2023年5月19日

Interpretable and Efficient Heterogeneous Graph Convolutional Network

Arxiv

15+阅读 · 2021年9月8日

GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training

Arxiv

14+阅读 · 2021年2月16日

Directional Graph Networks

Directional Graph Networks

Arxiv

27+阅读 · 2020年12月10日

VIP会员

文章信息

相关主题

通用动力公司

ReLU神经网络

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

[ICML-Google]先宽后窄:对深度薄网络的有效训练

[ICML-Google]先宽后窄:对深度薄网络的有效训练

专知会员服务

36+阅读 · 2020年7月5日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【论文推荐】 Bidirectional Self-Normalizing Neural Networks：双向自归一化神经网络

【论文推荐】 Bidirectional Self-Normalizing Neural Networks：双向自归一化神经网络

专知会员服务

17+阅读 · 2020年6月22日

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

专知会员服务

17+阅读 · 2020年6月4日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

【谷歌大脑新论文】利用可微摄动优化器进行学习，Learning with Differentiable Perturbed Optimizers

【谷歌大脑新论文】利用可微摄动优化器进行学习，Learning with Differentiable Perturbed Optimizers

专知会员服务

29+阅读 · 2020年2月22日

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

专知会员服务

24+阅读 · 2019年11月4日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

从零学习：从Python和R理解和编码神经网络（完整版）

从零学习：从Python和R理解和编码神经网络（完整版）

论智

24+阅读 · 2017年12月16日

干货|代码原理教你搞懂SGD随机梯度下降、BGD、MBGD

干货|代码原理教你搞懂SGD随机梯度下降、BGD、MBGD

机器学习研究会

12+阅读 · 2017年11月25日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

From Tempered to Benign Overfitting in ReLU Neural Networks

Arxiv

0+阅读 · 2023年5月24日

On the Size and Approximation Error of Distilled Sets

Arxiv

0+阅读 · 2023年5月23日

Mind the spikes: Benign overfitting of kernels and neural networks in fixed dimension

Arxiv

0+阅读 · 2023年5月23日

A new efficient explicit Deferred Correction framework: analysis and applications to hyperbolic PDEs and adaptivity

Arxiv

0+阅读 · 2023年5月22日

Rational approximations of operator monotone and operator convex functions

Arxiv

0+阅读 · 2023年5月21日

Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Arxiv

0+阅读 · 2023年5月21日

Complexity of Neural Network Training and ETR: Extensions with Effectively Continuous Functions

Arxiv

0+阅读 · 2023年5月19日

Interpretable and Efficient Heterogeneous Graph Convolutional Network

Arxiv

15+阅读 · 2021年9月8日

GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training

Arxiv

14+阅读 · 2021年2月16日

Directional Graph Networks

Directional Graph Networks

Arxiv

27+阅读 · 2020年12月10日

相关基金

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

Erdos-Sos猜想及几个相关的极值组合问题

国家自然科学基金

0+阅读 · 2012年12月31日

基于聚类的复杂网络社团结构发现

国家自然科学基金

0+阅读 · 2012年12月31日

Toeplitz矩阵函数的快速逼近算法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

耗散型Duffing方程的周期解与稳定性

国家自然科学基金

0+阅读 · 2011年12月31日

Mann型迭代算法中若干问题的研究

国家自然科学基金

0+阅读 · 2011年12月31日

周期微分方程与单位圆内的微分方程解的性质

国家自然科学基金

0+阅读 · 2011年12月31日

非线性方程中的拓扑与变分方法

国家自然科学基金

1+阅读 · 2011年12月31日

动力系统周期解与稳定性研究

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员