根据PAC-Bayesian理论视角分析彩票票票票票票假设 (Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective) - 专知论文

会员服务 ·

0

泛化理论 · 极小值 · 尖锐最小值 · Weight · 学习率 ·

2022 年 5 月 15 日

Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective

翻译：根据PAC-Bayesian理论视角分析彩票票票票票票假设

Keitaro Sakamoto,Issei Sato

The lottery ticket hypothesis (LTH) has attracted attention because it can explain why over-parameterized models often show high generalization ability. It is known that when we use iterative magnitude pruning (IMP), which is an algorithm to find sparse networks with high generalization ability that can be trained from the initial weights independently, called winning tickets, the initial large learning rate does not work well in deep neural networks such as ResNet. However, since the initial large learning rate generally helps the optimizer to converge to flatter minima, we hypothesize that the winning tickets have relatively sharp minima, which is considered a disadvantage in terms of generalization ability. In this paper, we confirm this hypothesis and show that the PAC-Bayesian theory can provide an explicit understanding of the relationship between LTH and generalization behavior. On the basis of our experimental findings that flatness is useful for improving accuracy and robustness to label noise and that the distance from the initial weights is deeply involved in winning tickets, we offer the PAC-Bayes bound using a spike-and-slab distribution to analyze winning tickets. Finally, we revisit existing algorithms for finding winning tickets from a PAC-Bayesian perspective and provide new insights into these methods.

翻译：彩票假设(LTH)吸引了人们的注意,因为它可以解释为什么过度参数化的模型往往表现出高度的概括化能力。众所周知,当我们使用迭代规模的剪切(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP))(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(IMP)(I)(IMP)(IMP)(IMP)(I)(IMP(IMP)(IMP)(I(IMP)(I)(IMP)(IMP(IMP)(IMP(IMP(I)(IMP)(I)(IMP(IMP)(IMP(IMP)(IMP(IMP))(I)(IMP)(I)(I)(IMP(IMP(IMP)(IMP)(IMP))(I)(IMP(IMP)(I)(IMP)(I)(IMP(I)(I)(I)(IMP)))(I))(I)(IMP)(I)(I)(I)(I(I)(I)(Ig)(Ig)(Ig)(Ig)(Ig)(Ig)(Ig)(Ig)(I(Ig)(Ig)(I)(I(I)(I(I(I(I(I)))))(I)(I)(I(I)(I)(I(I(I)(I)(I)(I)(I)(I)(I))(I)(I)(IMP)(I)))(I)(I)(I)(I)(I)(IMP)(I))(I)(IMP)(I)(I(I)(I(I(IMP)(IMP)(IMP

0

相关内容

泛化理论

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

265页《数值线性代数基础》，密西西比大学Seongjai Kim教授最新讲义，Fundamentals of Numerical Linear Algebra

265页《数值线性代数基础》，密西西比大学Seongjai Kim教授最新讲义，Fundamentals of Numerical Linear Algebra

专知会员服务

45+阅读 · 2022年3月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Caspase-8-NLRP1/3信号通路在BMMSCs保护青光眼视神经损伤的作用

国家自然科学基金

0+阅读 · 2014年12月31日

基于TLRs/NF-κB信号通路的“黄芩-黄连”药对防治胰岛素抵抗作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

miR-340/c-Met通过下调MMP-9表达缓解肝脏缺血再灌注损伤的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于三维指纹图谱的中药质控分析新方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Kupffer细胞上GITRL在大鼠肝移植免疫耐受重建中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

机载阵列下视SAR高分辨率成像模型与处理方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

机载InSAR区域网平差方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

催化型氮杂Wittig反应合成多取代杂环的新方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

甘草素（liquiritigenin）抗肝肿瘤作用及其氧化应激机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

暗弱空间目标定位方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

An Empirical Study of Implicit Regularization in Deep Offline RL

Arxiv

0+阅读 · 2022年7月5日

Ensemble feature selection with data-driven thresholding for Alzheimer's disease biomarker discovery

Arxiv

0+阅读 · 2022年7月5日

Lottery Ticket Hypothesis for Spiking Neural Networks

Arxiv

0+阅读 · 2022年7月4日

ReLU Deep Neural Networks from the Hierarchical Basis Perspective

ReLU Deep Neural Networks from the Hierarchical Basis Perspective

Arxiv

0+阅读 · 2022年7月1日

Characterizing the Effect of Class Imbalance on the Learning Dynamics

Arxiv

0+阅读 · 2022年7月1日

Adaptation of the Tuning Parameter in General Bayesian Inference with Robust Divergence

Arxiv

0+阅读 · 2022年7月1日

A Bayesian 'sandwich' for variance estimation and hypothesis testing

Arxiv

0+阅读 · 2022年6月30日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Optimization for deep learning: theory and algorithms

Optimization for deep learning: theory and algorithms

Arxiv

106+阅读 · 2019年12月19日

Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection

Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection

Arxiv

15+阅读 · 2019年3月18日

VIP会员

文章信息

相关主题

尖锐最小值

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

265页《数值线性代数基础》，密西西比大学Seongjai Kim教授最新讲义，Fundamentals of Numerical Linear Algebra

265页《数值线性代数基础》，密西西比大学Seongjai Kim教授最新讲义，Fundamentals of Numerical Linear Algebra

专知会员服务

45+阅读 · 2022年3月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

An Empirical Study of Implicit Regularization in Deep Offline RL

Arxiv

0+阅读 · 2022年7月5日

Ensemble feature selection with data-driven thresholding for Alzheimer's disease biomarker discovery

Arxiv

0+阅读 · 2022年7月5日

Lottery Ticket Hypothesis for Spiking Neural Networks

Arxiv

0+阅读 · 2022年7月4日

ReLU Deep Neural Networks from the Hierarchical Basis Perspective

ReLU Deep Neural Networks from the Hierarchical Basis Perspective

Arxiv

0+阅读 · 2022年7月1日

Characterizing the Effect of Class Imbalance on the Learning Dynamics

Arxiv

0+阅读 · 2022年7月1日

Adaptation of the Tuning Parameter in General Bayesian Inference with Robust Divergence

Arxiv

0+阅读 · 2022年7月1日

A Bayesian 'sandwich' for variance estimation and hypothesis testing

Arxiv

0+阅读 · 2022年6月30日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Optimization for deep learning: theory and algorithms

Optimization for deep learning: theory and algorithms

Arxiv

106+阅读 · 2019年12月19日

Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection

Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection

Arxiv

15+阅读 · 2019年3月18日

相关基金

Caspase-8-NLRP1/3信号通路在BMMSCs保护青光眼视神经损伤的作用

国家自然科学基金

0+阅读 · 2014年12月31日

基于TLRs/NF-κB信号通路的“黄芩-黄连”药对防治胰岛素抵抗作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

miR-340/c-Met通过下调MMP-9表达缓解肝脏缺血再灌注损伤的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于三维指纹图谱的中药质控分析新方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Kupffer细胞上GITRL在大鼠肝移植免疫耐受重建中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

机载阵列下视SAR高分辨率成像模型与处理方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

机载InSAR区域网平差方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

催化型氮杂Wittig反应合成多取代杂环的新方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

甘草素（liquiritigenin）抗肝肿瘤作用及其氧化应激机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

暗弱空间目标定位方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员