神经网络中的深度分离:什么是实际分离? (Depth Separations in Neural Networks: What is Actually Being Separated?) - 专知论文

会员服务 ·

0

Networks · 泛函 · 分离的 · 模型评估 · 近似 ·

2021 年 6 月 2 日

Depth Separations in Neural Networks: What is Actually Being Separated?

翻译：神经网络中的深度分离:什么是实际分离?

Itay Safran,Ronen Eldan,Ohad Shamir

Existing depth separation results for constant-depth networks essentially show that certain radial functions in $\mathbb{R}^d$, which can be easily approximated with depth $3$ networks, cannot be approximated by depth $2$ networks, even up to constant accuracy, unless their size is exponential in $d$. However, the functions used to demonstrate this are rapidly oscillating, with a Lipschitz parameter scaling polynomially with the dimension $d$ (or equivalently, by scaling the function, the hardness result applies to $\mathcal{O}(1)$-Lipschitz functions only when the target accuracy $\epsilon$ is at most $\text{poly}(1/d)$). In this paper, we study whether such depth separations might still hold in the natural setting of $\mathcal{O}(1)$-Lipschitz radial functions, when $\epsilon$ does not scale with $d$. Perhaps surprisingly, we show that the answer is negative: In contrast to the intuition suggested by previous work, it \emph{is} possible to approximate $\mathcal{O}(1)$-Lipschitz radial functions with depth $2$, size $\text{poly}(d)$ networks, for every constant $\epsilon$. We complement it by showing that approximating such functions is also possible with depth $2$, size $\text{poly}(1/\epsilon)$ networks, for every constant $d$. Finally, we show that it is not possible to have polynomial dependence in both $d,1/\epsilon$ simultaneously. Overall, our results indicate that in order to show depth separations for expressing $\mathcal{O}(1)$-Lipschitz functions with constant accuracy -- if at all possible -- one would need fundamentally different techniques than existing ones in the literature.

翻译：恒定深度网络的现有深度分离结果基本上显示,某些以${mathbb{R ⁇ d$(如果目标精度为$\mathb{O}(1)$-Lipschitz$(如果目标精度为$\eepsilon$(美元)),那么,即使深度网络的大小以美元为指数,也不可能以恒定的准确度为基数,即使其规模以美元计算。然而,用来显示这种深度分离的功能是快速振动的,如果利普西茨的参数以美元为基数(或者通过调整功能,硬度结果适用于美元=mathalcal{O}(如果目标精度精度为$),美元=oqualth$(美元),那么我们现有的答案是否定的:与先前工作建议的直观值相比, 美元=emsil=xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx) 函数显示可能的常值。

0

相关内容

Networks

Explanation：网络。 Publisher：Wiley。 SIT： http://dblp.uni-trier.de/db/journals/networks/

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

专知会员服务

22+阅读 · 2020年11月13日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

专知会员服务

53+阅读 · 2020年4月8日

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

专知会员服务

110+阅读 · 2020年2月22日

【论文推荐WWW2020-】休息:用于野外睡眠监测的强大而有效的神经网络 REST: Robust and Efficient Neural Networks for Sleep Monitoring in the Wild

【论文推荐WWW2020-】休息:用于野外睡眠监测的强大而有效的神经网络 REST: Robust and Efficient Neural Networks for Sleep Monitoring in the Wild

专知会员服务

7+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

已删除

将门创投

11+阅读 · 2019年4月26日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Approximating Sumset Size

Arxiv

0+阅读 · 2021年7月26日

RAC-drawability is ER-complete

Arxiv

0+阅读 · 2021年7月24日

Optimal Approximation Rate of ReLU Networks in terms of Width and Depth

Arxiv

0+阅读 · 2021年7月24日

Numerical approximation of the scattering amplitude in elasticity

Arxiv

0+阅读 · 2021年7月24日

Deep ReLU neural networks in high-dimensional approximation

Arxiv

0+阅读 · 2021年7月23日

Error Estimates for Neural Network Solutions of Partial Differential Equations

Arxiv

0+阅读 · 2021年7月23日

Function approximation by deep neural networks with parameters $\{0,\pm \frac{1}{2}, \pm 1, 2\}$

Arxiv

0+阅读 · 2021年7月23日

Cosine and Computation

Arxiv

0+阅读 · 2021年7月20日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

VIP会员

文章信息

相关主题

相关VIP内容

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

专知会员服务

22+阅读 · 2020年11月13日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

专知会员服务

53+阅读 · 2020年4月8日

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

专知会员服务

110+阅读 · 2020年2月22日

【论文推荐WWW2020-】休息:用于野外睡眠监测的强大而有效的神经网络 REST: Robust and Efficient Neural Networks for Sleep Monitoring in the Wild

【论文推荐WWW2020-】休息:用于野外睡眠监测的强大而有效的神经网络 REST: Robust and Efficient Neural Networks for Sleep Monitoring in the Wild

专知会员服务

7+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《美陆军特种作战条令》最新102页

《洛克希德SR-71“黑鸟”侦察机动力系统》21页slides

美空军作战实验室通过人工智能和指挥控制技术创新推进杀伤链

《指挥控制能力分析方法论》最新报告

相关资讯

已删除

将门创投

11+阅读 · 2019年4月26日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Approximating Sumset Size

Arxiv

0+阅读 · 2021年7月26日

RAC-drawability is ER-complete

Arxiv

0+阅读 · 2021年7月24日

Optimal Approximation Rate of ReLU Networks in terms of Width and Depth

Arxiv

0+阅读 · 2021年7月24日

Numerical approximation of the scattering amplitude in elasticity

Arxiv

0+阅读 · 2021年7月24日

Deep ReLU neural networks in high-dimensional approximation

Arxiv

0+阅读 · 2021年7月23日

Error Estimates for Neural Network Solutions of Partial Differential Equations

Arxiv

0+阅读 · 2021年7月23日

Function approximation by deep neural networks with parameters $\{0,\pm \frac{1}{2}, \pm 1, 2\}$

Arxiv

0+阅读 · 2021年7月23日

Cosine and Computation

Arxiv

0+阅读 · 2021年7月20日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

微信扫码咨询专知VIP会员