深网中稳定阶层的影响 (On the Impact of Stable Ranks in Deep Nets) - 专知论文

会员服务 ·

0

秩 · DNN · MoDELS · 分解的 · Neural Networks ·

2021 年 10 月 5 日

On the Impact of Stable Ranks in Deep Nets

翻译：深网中稳定阶层的影响

Bogdan Georgiev,Lukas Franken,Mayukh Mukherjee,Georgios Arvanitidis

from arxiv, 24 pages, 8 figures, comments welcome!

A recent line of work has established intriguing connections between the generalization/compression properties of a deep neural network (DNN) model and the so-called layer weights' stable ranks. Intuitively, the latter are indicators of the effective number of parameters in the net. In this work, we address some natural questions regarding the space of DNNs conditioned on the layers' stable rank, where we study feed-forward dynamics, initialization, training and expressivity. To this end, we first propose a random DNN model with a new sampling scheme based on stable rank. Then, we show how feed-forward maps are affected by the constraint and how training evolves in the overparametrized regime (via Neural Tangent Kernels). Our results imply that stable ranks appear layerwise essentially as linear factors whose effect accumulates exponentially depthwise. Moreover, we provide empirical analysis suggesting that stable rank initialization alone can lead to convergence speed ups.

翻译：最近的一项工作在深神经网络(DNN)模型的一般化/压缩特性与所谓的层权重稳定等级之间建立起了令人感兴趣的联系,从直觉上看,后者是网络参数有效数量的指标。在这项工作中,我们处理一些关于以层稳定等级为条件的DNN空间的自然问题,我们在那里研究进料向导动态、初始化、培训和表达性。为此,我们首先提出一个随机的DNN模型,并采用以稳定等级为基础的新取样办法。然后,我们展示进料向前的地图如何受到制约的影响,以及培训在过度平衡制度(通过Neural Tangent Kernels)中是如何演变的。我们的结果表明,稳定的等级基本上看起来是线性因素,其效应会以指数深度指数累积。此外,我们提供经验分析表明,单稳定级初始化本身就能够导致趋同速度的。

0

相关内容

鲁棒表示学习简述

专知会员服务

26+阅读 · 2021年4月13日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

已删除

将门创投

4+阅读 · 2017年11月1日

ReduNet: A White-box Deep Network from the Principle of Maximizing Rate Reduction

Arxiv

0+阅读 · 2021年11月29日

On the Global Convergence of Gradient Descent for multi-layer ResNets in the mean-field regime

Arxiv

0+阅读 · 2021年11月28日

How Does a Neural Network's Architecture Impact Its Robustness to Noisy Labels?

Arxiv

0+阅读 · 2021年11月28日

Towards Understanding the Impact of Model Size on Differential Private Classification

Arxiv

0+阅读 · 2021年11月27日

Joint inference and input optimization in equilibrium networks

Arxiv

0+阅读 · 2021年11月25日

Efficiency of the financial markets during the COVID-19 crisis: time-varying parameters of fractional stable dynamics

Arxiv

0+阅读 · 2021年11月25日

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

Arxiv

15+阅读 · 2021年9月6日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels

Arxiv

8+阅读 · 2019年11月4日

Stable Distribution Alignment Using the Dual of the Adversarial Distance

Arxiv

3+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

鲁棒表示学习简述

专知会员服务

26+阅读 · 2021年4月13日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

已删除

将门创投

4+阅读 · 2017年11月1日

相关论文

ReduNet: A White-box Deep Network from the Principle of Maximizing Rate Reduction

Arxiv

0+阅读 · 2021年11月29日

On the Global Convergence of Gradient Descent for multi-layer ResNets in the mean-field regime

Arxiv

0+阅读 · 2021年11月28日

How Does a Neural Network's Architecture Impact Its Robustness to Noisy Labels?

Arxiv

0+阅读 · 2021年11月28日

Towards Understanding the Impact of Model Size on Differential Private Classification

Arxiv

0+阅读 · 2021年11月27日

Joint inference and input optimization in equilibrium networks

Arxiv

0+阅读 · 2021年11月25日

Efficiency of the financial markets during the COVID-19 crisis: time-varying parameters of fractional stable dynamics

Arxiv

0+阅读 · 2021年11月25日

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

Arxiv

15+阅读 · 2021年9月6日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels

Arxiv

8+阅读 · 2019年11月4日

Stable Distribution Alignment Using the Dual of the Adversarial Distance

Arxiv

3+阅读 · 2018年1月30日

微信扫码咨询专知VIP会员