通过神经崩溃,有原则、高效率地转让深模型的深层模型学习 (Principled and Efficient Transfer Learning of Deep Models via Neural Collapse) - 专知论文

会员服务 ·

0

Learning · Principle · MoDELS · 迁移学习 · Better ·

2023 年 1 月 4 日

Principled and Efficient Transfer Learning of Deep Models via Neural Collapse

翻译：通过神经崩溃,有原则、高效率地转让深模型的深层模型学习

Xiao Li,Sheng Liu,Jinxin Zhou,Xinyu Lu,Carlos Fernandez-Granda,Zhihui Zhu,Qing Qu

from arxiv, First two authors contributed equally, 24 pages, 13 figures, and 5 tables

With the ever-growing model size and the limited availability of labeled training data, transfer learning has become an increasingly popular approach in many science and engineering domains. For classification problems, this work delves into the mystery of transfer learning through an intriguing phenomenon termed neural collapse (NC), where the last-layer features and classifiers of learned deep networks satisfy: (i) the within-class variability of the features collapses to zero, and (ii) the between-class feature means are maximally and equally separated. Through the lens of NC, our findings for transfer learning are the following: (i) when pre-training models, preventing intra-class variability collapse (to a certain extent) better preserves the intrinsic structures of the input data, so that it leads to better model transferability; (ii) when fine-tuning models on downstream tasks, obtaining features with more NC on downstream data results in better test accuracy on the given task. The above results not only demystify many widely used heuristics in model pre-training (e.g., data augmentation, projection head, self-supervised learning), but also leads to more efficient and principled fine-tuning method on downstream tasks that we demonstrate through extensive experimental results.

翻译：随着模型规模的不断扩大和标签培训数据的有限提供,转让学习在许多科学和工程领域已成为日益流行的方法。关于分类问题,这项工作深入到通过一种令人感兴趣的现象,即神经崩溃(NC)转移学习的奥秘,在这种令人感兴趣的现象中,学习深层次网络的最后一层特征和分类者能够满足:(一) 特征的分类内变异性向零下降,以及(二) 阶级间特征手段在最大程度上和平等分离。从NC的角度来看,我们的转让学习结果如下:(一) 当培训前模式防止(在某种程度上)类内变异性崩溃时,更好地保存输入数据的内在结构,从而导致更好的模式可转移性;(二) 当对下游任务进行微调模型时,在下游数据中取得更多NC的特征,从而更好地测试任务是否准确性。上述结果不仅消除了模型培训前许多广泛使用的超自然学的神秘性(例如数据增强、投影头、自我校准学习),而且还导致通过广泛的下游方法展示我们如何广泛进行试验的结果。

0

相关内容

Learning

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于浸入边界法的复杂动边界紊流的数值模拟

国家自然科学基金

0+阅读 · 2014年12月31日

胆道闭锁中miR-21介导PTEN信号通路对肝纤维化的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于“肺络微型癥瘕”探讨补肾益肺消癥法干预II型肺泡上皮细胞ERS的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

Riemann-Hilbert方法及若干相关问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

非常规电子分布及相关束流不稳定性和非线性动力学的研究

国家自然科学基金

0+阅读 · 2012年12月31日

铜绿假单胞菌 ExoS 毒素蛋白诱导的细胞凋亡信号通路研究

国家自然科学基金

0+阅读 · 2012年12月31日

水通道蛋白4调节帕金森病不同亚群多巴胺能神经元损伤易感性差异的研究

国家自然科学基金

0+阅读 · 2011年12月31日

LL-37经骨髓间充质干细胞携带回输干预耐多药铜绿假单胞菌肺部感染的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Rethinking Efficient Tuning Methods from a Unified Perspective

Arxiv

0+阅读 · 2023年3月1日

ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients

Arxiv

1+阅读 · 2023年3月1日

The Role of Pre-training Data in Transfer Learning

Arxiv

0+阅读 · 2023年3月1日

Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年3月1日

A Non-Asymptotic Analysis of Oversmoothing in Graph Neural Networks

Arxiv

0+阅读 · 2023年3月1日

Efficient Masked Autoencoders with Self-Consistency

Arxiv

0+阅读 · 2023年2月28日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

Graph Contrastive Learning with Adaptive Augmentation

Arxiv

10+阅读 · 2021年2月26日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Rethinking Efficient Tuning Methods from a Unified Perspective

Arxiv

0+阅读 · 2023年3月1日

ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients

Arxiv

1+阅读 · 2023年3月1日

The Role of Pre-training Data in Transfer Learning

Arxiv

0+阅读 · 2023年3月1日

Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年3月1日

A Non-Asymptotic Analysis of Oversmoothing in Graph Neural Networks

Arxiv

0+阅读 · 2023年3月1日

Efficient Masked Autoencoders with Self-Consistency

Arxiv

0+阅读 · 2023年2月28日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

Graph Contrastive Learning with Adaptive Augmentation

Arxiv

10+阅读 · 2021年2月26日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

相关基金

基于浸入边界法的复杂动边界紊流的数值模拟

国家自然科学基金

0+阅读 · 2014年12月31日

胆道闭锁中miR-21介导PTEN信号通路对肝纤维化的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于“肺络微型癥瘕”探讨补肾益肺消癥法干预II型肺泡上皮细胞ERS的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

Riemann-Hilbert方法及若干相关问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

非常规电子分布及相关束流不稳定性和非线性动力学的研究

国家自然科学基金

0+阅读 · 2012年12月31日

铜绿假单胞菌 ExoS 毒素蛋白诱导的细胞凋亡信号通路研究

国家自然科学基金

0+阅读 · 2012年12月31日

水通道蛋白4调节帕金森病不同亚群多巴胺能神经元损伤易感性差异的研究

国家自然科学基金

0+阅读 · 2011年12月31日

LL-37经骨髓间充质干细胞携带回输干预耐多药铜绿假单胞菌肺部感染的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员