利用基于黑森普遍化保障对深神经网络进行强有力的精细调整 (Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees) - 专知论文

会员服务 ·

0

泛化理论 · motivation · Neural Networks · 稳健性 · Networking ·

2022 年 8 月 29 日

Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees

翻译：利用基于黑森普遍化保障对深神经网络进行强有力的精细调整

Haotian Ju,Dongyue Li,Hongyang R. Zhang

from arxiv, 36 pages, 5 figures, 7 tables; minor revision with a few additional references and corrected typos

We consider transfer learning approaches that fine-tune a pretrained deep neural network on a target task. We study generalization properties of fine-tuning to understand the problem of overfitting, which commonly occurs in practice. Previous works have shown that constraining the distance from the initialization of fine-tuning improves generalization. Using a PAC-Bayesian analysis, we observe that besides distance from initialization, Hessians affect generalization through the noise stability of deep neural networks against noise injections. Motivated by the observation, we develop Hessian distance-based generalization bounds for a wide range of fine-tuning methods. Additionally, we study the robustness of fine-tuning in the presence of noisy labels. Motivated by our theory, we design an algorithm that incorporates consistent losses and distance-based regularization for fine-tuning, along with a generalization error guarantee under class conditional independent noise in the training set labels. We perform a detailed empirical study of our algorithm on various noisy environments and architectures. On six image classification tasks whose training labels are generated with programmatic labeling, we find a 3.26% accuracy gain over prior fine-tuning methods. Meanwhile, the Hessian distance measure of the fine-tuned model decreases by six times more than existing approaches.

翻译：我们考虑在目标任务上微调精练深神经网络的传学方法。我们研究微调的普及性,以了解通常在实践中经常发生的超装问题。我们以前的工作表明,限制微调初始化的距离可以改进一般化。我们使用PAC-Bayesian分析发现,除了从初始化的距离外,赫森人通过深神经网络的噪音稳定性来影响一般化,反对噪音注入。我们受观察的驱动,我们开发了基于远距的海珊光谱化范围,以了解广泛的微调方法。此外,我们研究在噪音标签存在的情况下微调的稳健性。根据我们的理论,我们设计了一种算法,将持续损失和基于远程的微调正规化纳入其中,同时在培训标签中附加一个条件性独立噪音的班级一般化错误保证。我们对各种噪音环境和结构的算法进行了详细的实证研究。在六种图像分类任务中,其培训标签是用方案标签生成的,我们发现一个3.26的精确度模型,比现有的微调方法高出了六度。同时,他还发现比现有的微调方法改进了3.26的精确度。

0

相关内容

泛化理论

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

Med25作为共激活因子对糖皮质激素受体GRα介导的CYP2C9的调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

拟南芥类受体蛋白激酶CRKN1在脱落酸信号转导中的功能研究

国家自然科学基金

0+阅读 · 2015年12月31日

IMP3调控上皮间质转化和肿瘤干细胞进而参与结肠癌发生和转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于上皮间充质转化和细胞外基质沉积研究人参皂甙Rg1对COPD发生发展的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Lee偏差在试验设计中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

约束优化方法及其在图像恢复中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

生物特征识别中高维数据的统计降维及算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

GSK-3β调控血管平滑肌细胞特异性转录因子Myocardin对动脉粥样硬化斑块形成作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Statistical, Robustness, and Computational Guarantees for Sliced Wasserstein Distances

Arxiv

0+阅读 · 2022年10月17日

Neural Lyapunov Control of Unknown Nonlinear Systems with Stability Guarantees

Arxiv

0+阅读 · 2022年10月16日

Active Learning with Neural Networks: Insights from Nonparametric Statistics

Arxiv

0+阅读 · 2022年10月15日

Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power

Arxiv

0+阅读 · 2022年10月14日

Effective Class-Imbalance learning based on SMOTE and Convolutional Neural Networks

Arxiv

0+阅读 · 2022年10月13日

Generalization Bounds with Minimal Dependency on Hypothesis Class via Distributionally Robust Optimization

Arxiv

0+阅读 · 2022年10月12日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Faster Meta Update Strategy for Noise-Robust Deep Learning

Arxiv

11+阅读 · 2021年4月30日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Statistical, Robustness, and Computational Guarantees for Sliced Wasserstein Distances

Arxiv

0+阅读 · 2022年10月17日

Neural Lyapunov Control of Unknown Nonlinear Systems with Stability Guarantees

Arxiv

0+阅读 · 2022年10月16日

Active Learning with Neural Networks: Insights from Nonparametric Statistics

Arxiv

0+阅读 · 2022年10月15日

Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power

Arxiv

0+阅读 · 2022年10月14日

Effective Class-Imbalance learning based on SMOTE and Convolutional Neural Networks

Arxiv

0+阅读 · 2022年10月13日

Generalization Bounds with Minimal Dependency on Hypothesis Class via Distributionally Robust Optimization

Arxiv

0+阅读 · 2022年10月12日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Faster Meta Update Strategy for Noise-Robust Deep Learning

Arxiv

11+阅读 · 2021年4月30日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

Med25作为共激活因子对糖皮质激素受体GRα介导的CYP2C9的调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

拟南芥类受体蛋白激酶CRKN1在脱落酸信号转导中的功能研究

国家自然科学基金

0+阅读 · 2015年12月31日

IMP3调控上皮间质转化和肿瘤干细胞进而参与结肠癌发生和转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于上皮间充质转化和细胞外基质沉积研究人参皂甙Rg1对COPD发生发展的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Lee偏差在试验设计中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

约束优化方法及其在图像恢复中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

生物特征识别中高维数据的统计降维及算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

GSK-3β调控血管平滑肌细胞特异性转录因子Myocardin对动脉粥样硬化斑块形成作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员