近近Minimax 与浅光雷雨天线网神经网络进行最佳估计 (Near-Minimax Optimal Estimation With Shallow ReLU Neural Networks) - 专知论文

会员服务 ·

0

估计/估计量 · Neural Networks · Networking · 泛函 · ReLU ·

2021 年 9 月 18 日

Near-Minimax Optimal Estimation With Shallow ReLU Neural Networks

翻译：近近Minimax 与浅光雷雨天线网神经网络进行最佳估计

Rahul Parhi,Robert D. Nowak

We study the problem of estimating an unknown function from noisy data using shallow (single-hidden layer) ReLU neural networks. The estimators we study minimize the sum of squared data-fitting errors plus a regularization term proportional to the Euclidean norm of the network weights. This minimization corresponds to the common approach of training a neural network with weight decay. We quantify the performance (mean-squared error) of these neural network estimators when the data-generating function belongs to the space of functions of second-order bounded variation in the Radon domain. This space of functions was recently proposed as the natural function space associated with shallow ReLU neural networks. We derive a minimax lower bound for the estimation problem for this function space and show that the neural network estimators are minimax optimal up to logarithmic factors. We also show that this is a "mixed variation" function space that contains classical multivariate function spaces including certain Sobolev spaces and certain spectral Barron spaces. Finally, we use these results to quantify a gap between neural networks and linear methods (which include kernel methods). This paper sheds light on the phenomenon that neural networks seem to break the curse of dimensionality.

翻译：我们用浅层(单隐藏层) ReLU 神经网络来研究利用浅层(单隐藏层) ReLU 神经网络来估计一个未知功能的问题。我们研究的测量器将平方数据适应错误和与网络重量的Euclidean规范成正比的正规化术语之和最小化。最小化相当于对神经网络进行重量衰减培训的共同方法。当数据生成功能属于Radon 域内第二顺序约束性变异功能的空间时, 我们量化这些神经网络观测器的性能( 平均偏差) 。这个功能的空间最近被提议为与浅ReLU 神经网络相关的自然功能空间。我们为这个功能空间的估算器生成了一个最小化的最小值, 并显示神经网络估计器的微缩度最符合对数系数。我们还显示这是一个“ 混合变异” 功能空间, 包含典型的多变异功能空间, 包括某些Sobolev 空间和某些光谱 Barron 空间。最后, 我们用这些结果来量化神经网络和线性网络的断裂现象方法。

0

相关内容

估计/估计量

估计/估计量

【Cell】神经算法推理，Neural algorithmic reasoning

【Cell】神经算法推理，Neural algorithmic reasoning

专知会员服务

29+阅读 · 2021年7月16日

ICML2021接受论文列表出炉！1184篇论文都在这了！

专知会员服务

92+阅读 · 2021年6月3日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

专知会员服务

66+阅读 · 2020年6月22日

简明《神经网络数学》手册，16页pdf带你入门，Mathematics of Neural Networks

简明《神经网络数学》手册，16页pdf带你入门，Mathematics of Neural Networks

专知会员服务

68+阅读 · 2020年5月9日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

已删除

将门创投

3+阅读 · 2019年10月18日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Collocation approximation by deep neural ReLU networks for parametric elliptic PDEs with lognormal inputs

Arxiv

0+阅读 · 2021年11月10日

Groups Influence with Minimum Cost in Social Networks

Arxiv

0+阅读 · 2021年11月8日

Rate-Optimal Cluster-Randomized Designs for Spatial Interference

Arxiv

0+阅读 · 2021年11月8日

Rates of convergence for density estimation with GANs

Arxiv

0+阅读 · 2021年11月7日

Frequency Estimation with One-Sided Error

Arxiv

0+阅读 · 2021年11月6日

Generalizations to Corrections for the Effects of Measurement Error in Approximately Consistent Methodologies

Generalizations to Corrections for the Effects of Measurement Error in Approximately Consistent Methodologies

Arxiv

0+阅读 · 2021年11月5日

Numerical Approximation of Optimal Convex and Rotationally Symmetric Shapes for an Eigenvalue Problem arising in Optimal Insulation

Arxiv

0+阅读 · 2021年11月5日

Fast Deterministic Fully Dynamic Distance Approximation

Arxiv

0+阅读 · 2021年11月5日

Rate of Convergence of Polynomial Networks to Gaussian Processes

Arxiv

0+阅读 · 2021年11月4日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

VIP会员

文章信息

相关主题

估计/估计量

Neural Networks

相关VIP内容

【Cell】神经算法推理，Neural algorithmic reasoning

【Cell】神经算法推理，Neural algorithmic reasoning

专知会员服务

29+阅读 · 2021年7月16日

ICML2021接受论文列表出炉！1184篇论文都在这了！

专知会员服务

92+阅读 · 2021年6月3日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

专知会员服务

66+阅读 · 2020年6月22日

简明《神经网络数学》手册，16页pdf带你入门，Mathematics of Neural Networks

简明《神经网络数学》手册，16页pdf带你入门，Mathematics of Neural Networks

专知会员服务

68+阅读 · 2020年5月9日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身操作的视觉-语言-动作模型综述

《多域空战指挥体系：驾驭复杂性的艺术》

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

相关资讯

已删除

将门创投

3+阅读 · 2019年10月18日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Collocation approximation by deep neural ReLU networks for parametric elliptic PDEs with lognormal inputs

Arxiv

0+阅读 · 2021年11月10日

Groups Influence with Minimum Cost in Social Networks

Arxiv

0+阅读 · 2021年11月8日

Rate-Optimal Cluster-Randomized Designs for Spatial Interference

Arxiv

0+阅读 · 2021年11月8日

Rates of convergence for density estimation with GANs

Arxiv

0+阅读 · 2021年11月7日

Frequency Estimation with One-Sided Error

Arxiv

0+阅读 · 2021年11月6日

Generalizations to Corrections for the Effects of Measurement Error in Approximately Consistent Methodologies

Generalizations to Corrections for the Effects of Measurement Error in Approximately Consistent Methodologies

Arxiv

0+阅读 · 2021年11月5日

Numerical Approximation of Optimal Convex and Rotationally Symmetric Shapes for an Eigenvalue Problem arising in Optimal Insulation

Arxiv

0+阅读 · 2021年11月5日

Fast Deterministic Fully Dynamic Distance Approximation

Arxiv

0+阅读 · 2021年11月5日

Rate of Convergence of Polynomial Networks to Gaussian Processes

Arxiv

0+阅读 · 2021年11月4日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

微信扫码咨询专知VIP会员