以$L ⁇ 2}美元正规化的DNN:吸引力/振荡和公平性 (Feature Learning in $L_{2}$-regularized DNNs: Attraction/Repulsion and Sparsity) - 专知论文

会员服务 ·

0

特化 · 损失 · 情景 · 局部极小 · 极小点 ·

2022 年 10 月 13 日

Feature Learning in $L_{2}$-regularized DNNs: Attraction/Repulsion and Sparsity

翻译：以$L ⁇ 2}美元正规化的DNN:吸引力/振荡和公平性

Arthur Jacot,Eugene Golikov,Clément Hongler,Franck Gabriel

We study the loss surface of DNNs with $L_{2}$ regularization. We show that the loss in terms of the parameters can be reformulated into a loss in terms of the layerwise activations $Z_{\ell}$ of the training set. This reformulation reveals the dynamics behind feature learning: each hidden representations $Z_{\ell}$ are optimal w.r.t. to an attraction/repulsion problem and interpolate between the input and output representations, keeping as little information from the input as necessary to construct the activation of the next layer. For positively homogeneous non-linearities, the loss can be further reformulated in terms of the covariances of the hidden representations, which takes the form of a partially convex optimization over a convex cone. This second reformulation allows us to prove a sparsity result for homogeneous DNNs: any local minimum of the $L_{2}$-regularized loss can be achieved with at most $N(N+1)$ neurons in each hidden layer (where $N$ is the size of the training set). We show that this bound is tight by giving an example of a local minimum that requires $N^{2}/4$ hidden neurons. But we also observe numerically that in more traditional settings much less than $N^{2}$ neurons are required to reach the minima.

翻译：我们用$L+$2美元正规化来研究DNNs的损失表面。我们显示, 参数方面的损失可以重塑成一个损失, 也就是从DNNs的层次启动 $@ ell} 美元培训组的分层启动 $@ ell} 美元。这一重现揭示了特征学习背后的动态: 每一个隐藏的表示 $ ell} 美元都是最佳的 w.r. t 问题, 并且将输入和输出表示之间的中间插插点, 将输入和输出表示中所需的信息作为构建下层激活所需的信息少一些。对于正均匀的非线性, 可以从隐藏的表示的表达方式的变异性中进一步重现损失, 其形式是部分的 convex 优化。第二次重拟让我们证明, 每一个隐藏的DNNN: $2} 美元的当地最起码的最小值是每个隐藏层的神经元( $N+1) ( $是训练的大小 ) 。我们显示, 这个界限很紧凑, 通过给一个本地最低值的设置的例子, 也比隐藏的神经值要低。

0

相关内容

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

统计收敛的测度理论与超滤子收敛

国家自然科学基金

0+阅读 · 2014年12月31日

Zintl团簇化学-合成，结构与反应性

国家自然科学基金

0+阅读 · 2014年12月31日

二维MoS2纳米片电子结构的高压调控及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

TiO2光催化机理的多时间尺度研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于上转换/金纳米粒子光开关构建可用于全血原位在体检测光纤生物探针的研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体层顶位形统计研究

国家自然科学基金

0+阅读 · 2012年12月31日

在细胞粘附与迁移中协调多种小GTP酶的Arap3的结构与功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于视觉注意机制的多尺度图像融合的研究

国家自然科学基金

1+阅读 · 2009年12月31日

Informative Sample-Aware Proxy for Deep Metric Learning

Arxiv

0+阅读 · 2022年11月18日

Efficiency of Learning from Proof Blocks Versus Writing Proofs

Arxiv

0+阅读 · 2022年11月17日

Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences

Arxiv

0+阅读 · 2022年11月16日

Analysis and Detectability of Offline Data Poisoning Attacks on Linear Systems

Arxiv

0+阅读 · 2022年11月16日

Addressing the issue of stochastic environments and local decision-making in multi-objective reinforcement learning

Arxiv

0+阅读 · 2022年11月16日

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning

Arxiv

14+阅读 · 2022年3月25日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

On Feature Normalization and Data Augmentation

On Feature Normalization and Data Augmentation

Arxiv

15+阅读 · 2020年2月25日

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

Arxiv

79+阅读 · 2020年1月19日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

VIP会员

文章信息

相关主题

相关VIP内容

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Informative Sample-Aware Proxy for Deep Metric Learning

Arxiv

0+阅读 · 2022年11月18日

Efficiency of Learning from Proof Blocks Versus Writing Proofs

Arxiv

0+阅读 · 2022年11月17日

Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences

Arxiv

0+阅读 · 2022年11月16日

Analysis and Detectability of Offline Data Poisoning Attacks on Linear Systems

Arxiv

0+阅读 · 2022年11月16日

Addressing the issue of stochastic environments and local decision-making in multi-objective reinforcement learning

Arxiv

0+阅读 · 2022年11月16日

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning

Arxiv

14+阅读 · 2022年3月25日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

On Feature Normalization and Data Augmentation

On Feature Normalization and Data Augmentation

Arxiv

15+阅读 · 2020年2月25日

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

Arxiv

79+阅读 · 2020年1月19日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

统计收敛的测度理论与超滤子收敛

国家自然科学基金

0+阅读 · 2014年12月31日

Zintl团簇化学-合成，结构与反应性

国家自然科学基金

0+阅读 · 2014年12月31日

二维MoS2纳米片电子结构的高压调控及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

TiO2光催化机理的多时间尺度研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于上转换/金纳米粒子光开关构建可用于全血原位在体检测光纤生物探针的研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体层顶位形统计研究

国家自然科学基金

0+阅读 · 2012年12月31日

在细胞粘附与迁移中协调多种小GTP酶的Arap3的结构与功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于视觉注意机制的多尺度图像融合的研究

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员