探索深神经神经崩溃的扩展不受限制的特性模型 (Extended Unconstrained Features Model for Exploring Deep Neural Collapse) - 专知论文

会员服务 ·

0

层 · 正则化项 · MoDELS · Weight · 优化器 ·

2022 年 6 月 27 日

Extended Unconstrained Features Model for Exploring Deep Neural Collapse

翻译：探索深神经神经崩溃的扩展不受限制的特性模型

Tom Tirer,Joan Bruna

from arxiv, ICML 2022

The modern strategy for training deep neural networks for classification tasks includes optimizing the network's weights even after the training error vanishes to further push the training loss toward zero. Recently, a phenomenon termed "neural collapse" (NC) has been empirically observed in this training procedure. Specifically, it has been shown that the learned features (the output of the penultimate layer) of within-class samples converge to their mean, and the means of different classes exhibit a certain tight frame structure, which is also aligned with the last layer's weights. Recent papers have shown that minimizers with this structure emerge when optimizing a simplified "unconstrained features model" (UFM) with a regularized cross-entropy loss. In this paper, we further analyze and extend the UFM. First, we study the UFM for the regularized MSE loss, and show that the minimizers' features can have a more delicate structure than in the cross-entropy case. This affects also the structure of the weights. Then, we extend the UFM by adding another layer of weights as well as ReLU nonlinearity to the model and generalize our previous results. Finally, we empirically demonstrate the usefulness of our nonlinear extended UFM in modeling the NC phenomenon that occurs with practical networks.

翻译：为分类任务培训深神经网络的现代战略包括优化网络的重量,即使培训错误消失后网络的重量,以进一步将培训损失推向零。最近,在这一培训过程中,从经验中观察到了一个名为“神经崩溃”的现象。具体地说,已经表明,本类样本中学习到的特征(倒数第二层的产出)与其平均值一致,而不同类别的方法也表现出某种紧凑的框架结构,这也与最后一层的重量相一致。最近的文件表明,在优化简化的“不受限制的特征模型”(UFM)时,这一结构的最小化作用会显现出来,而简化的“不受限制的特征模型”(UFMM)则导致常规化的跨热带损失。在本文中,我们进一步分析并扩展了UFM。首先,我们为常规化的 MSE损失而研究UFM, 并表明,最小化器的特征结构可能比跨层的重量结构更为微妙。然后,我们通过增加另一层的重量来扩大UFMM,同时将 ReLU的非直线性化为模型,我们以往的实用性能模型最终展示了我们的UFM 。

0

相关内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【图深度学习GDL论文大全】A comprehensive collection of recent papers on graph deep learning

【图深度学习GDL论文大全】A comprehensive collection of recent papers on graph deep learning

专知会员服务

47+阅读 · 2019年12月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

发射可调铂(II)配合物的设计和新型静电喷雾沉积电致发光器件的制备

国家自然科学基金

0+阅读 · 2015年12月31日

分子光开关用于嵌段共聚物自组装纳米结构的超分辨荧光成像

国家自然科学基金

0+阅读 · 2014年12月31日

靶向调控PAI-1的lncRNAs在COPD肺泡上皮细胞凋亡中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

纤维素与小分子溶剂体系的相互作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

牛磺鹅去氧胆酸对大鼠肺泡巨噬细胞中TGR5受体介导的信号通路的影响

国家自然科学基金

0+阅读 · 2012年12月31日

Tandem型染敏太阳电池p型光阴极准一维微纳结构调控

国家自然科学基金

0+阅读 · 2012年12月31日

双极性树枝状蓝光PhOLED用Ir（Ⅲ）金属配合物的合成与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

多光子三维金属微纳结构加工基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

CIGSe纳米管/聚芴衍生物杂化太阳能电池材料设计、器件制备及光伏特性研究

国家自然科学基金

0+阅读 · 2008年12月31日

高灵敏FRET显微成像技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring

Arxiv

0+阅读 · 2022年8月17日

Do Invariances in Deep Neural Networks Align with Human Perception?

Arxiv

0+阅读 · 2022年8月16日

Towards Certified Robustness of Distance Metric Learning

Arxiv

0+阅读 · 2022年8月16日

A Latent Feature Analysis-based Approach for Spatio-Temporal Traffic Data Recovery

Arxiv

0+阅读 · 2022年8月16日

Reweighting the RCT for generalization: finite sample analysis and variable selection

Arxiv

0+阅读 · 2022年8月16日

Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks

Arxiv

0+阅读 · 2022年8月15日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Learning Discrete Structures for Graph Neural Networks

Arxiv

17+阅读 · 2019年3月28日

f-VAEGAN-D2: A Feature Generating Framework for Any-Shot Learning

Arxiv

11+阅读 · 2019年3月25日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【图深度学习GDL论文大全】A comprehensive collection of recent papers on graph deep learning

【图深度学习GDL论文大全】A comprehensive collection of recent papers on graph deep learning

专知会员服务

47+阅读 · 2019年12月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

《商用大语言模型的升级风险管理：国家安全运用》

自主人工智能：未来战争是否将是自主化的？

《从装备到文化：美陆军技术素养建设启示录》最新报告

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring

Arxiv

0+阅读 · 2022年8月17日

Do Invariances in Deep Neural Networks Align with Human Perception?

Arxiv

0+阅读 · 2022年8月16日

Towards Certified Robustness of Distance Metric Learning

Arxiv

0+阅读 · 2022年8月16日

A Latent Feature Analysis-based Approach for Spatio-Temporal Traffic Data Recovery

Arxiv

0+阅读 · 2022年8月16日

Reweighting the RCT for generalization: finite sample analysis and variable selection

Arxiv

0+阅读 · 2022年8月16日

Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks

Arxiv

0+阅读 · 2022年8月15日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Learning Discrete Structures for Graph Neural Networks

Arxiv

17+阅读 · 2019年3月28日

f-VAEGAN-D2: A Feature Generating Framework for Any-Shot Learning

Arxiv

11+阅读 · 2019年3月25日

相关基金

发射可调铂(II)配合物的设计和新型静电喷雾沉积电致发光器件的制备

国家自然科学基金

0+阅读 · 2015年12月31日

分子光开关用于嵌段共聚物自组装纳米结构的超分辨荧光成像

国家自然科学基金

0+阅读 · 2014年12月31日

靶向调控PAI-1的lncRNAs在COPD肺泡上皮细胞凋亡中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

纤维素与小分子溶剂体系的相互作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

牛磺鹅去氧胆酸对大鼠肺泡巨噬细胞中TGR5受体介导的信号通路的影响

国家自然科学基金

0+阅读 · 2012年12月31日

Tandem型染敏太阳电池p型光阴极准一维微纳结构调控

国家自然科学基金

0+阅读 · 2012年12月31日

双极性树枝状蓝光PhOLED用Ir（Ⅲ）金属配合物的合成与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

多光子三维金属微纳结构加工基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

CIGSe纳米管/聚芴衍生物杂化太阳能电池材料设计、器件制备及光伏特性研究

国家自然科学基金

0+阅读 · 2008年12月31日

高灵敏FRET显微成像技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员