NeurIPS 2023教程: 在超参数化模型时代重新考虑过拟合，231页ppt - 专知VIP

会员服务 ·

26

NeurIPS 2023 · 超参数调优 · 神经网络 · 过拟合 ·

2023 年 12 月 13 日

NeurIPS 2023教程: 在超参数化模型时代重新考虑过拟合，231页ppt

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

大型的、过度参数化的模型(如神经网络)现在是现代机器学习的主力。这些模型通常在有噪声的数据集上训练到接近于零的误差，同时很好地泛化到未见过的数据，这与教科书中关于过拟合风险的直觉形成了对比。与此同时，近乎完美的数据拟合可能在鲁棒性、隐私和公平性的背景下存在严重的问题。由于过度参数化，经典的理论框架几乎没有为导航这些问题提供指导。因此，发展关于过拟合和泛化的新直觉至关重要，这些直觉反映了这些经验观察。在本教程中，我们将讨论学习理论文献中的最新工作，这些工作为这些现象提供了理论见解。参考文献： * Hastie, Trevor and Montanari, Andrea and Rosset, Saharon and Tibshirani, Ryan J (2022). Surprises in high-dimensional ridgeless least squares interpolation. Annals of Statistics. * Bartlett, Peter L and Long, Philip M and Lugosi, Gabor and Tsigler, Alexander (2020). Benign overfitting in linear regression. PNAS. * Muthukumar, Vidya and Vodrahalli, Kailas and Subramanian, Vignesh and Sahai, Anant (2020). Harmless interpolation of noisy data in regression. IEEE Journal on Selected Areas in Information Theory. * Wang, Guillaume and Donhauser, Konstantin and Yang, Fanny (2022). Tight bounds for minimum ℓ1-norm interpolation of noisy data. In: AISTATS. * Donhauser, Konstantin and Ruggeri, Nicolo and Stojanovic, Stefan and Yang, Fanny (2022). Fast rates for noisy interpolation require rethinking the effects of inductive bias. In: ICML. * Hsu, Daniel and Muthukumar, Vidya and Xu, Ji (2021). On the proliferation of support vectors in high dimensions. In: AISTATS. * Muthukumar, Vidya and Narang, Adhyyan and Subramanian, Vignesh and Belkin, Mikhail and Hsu, Daniel and Sahai, Anant (2021). Classification vs regression in overparameterized regimes: Does the loss function matter?. Journal of Machine Learning Research. * Frei, Spencer and Vardi, Gal and Bartlett, Peter and Srebro, Nathan and Hu, Wei (2023). Implicit Bias in Leaky ReLU Networks Trained on High-Dimensional Data. In: ICLR. * Frei, Spencer and Vardi, Gal and L., Peter and Srebro, Nathan (2023). Benign Overfitting in Linear Classifiers and Leaky ReLU Networks from KKT Conditions for Margin Maximization. In: COLT. * Xu, Xingyu and Gu, Yuantao (2023). Benign overfitting of non-smooth neural networks beyond lazy training. In: AISTATS.

成为VIP会员查看完整内容

49

相关内容

NeurIPS 2023

【NeurIPS 2023教程】隐扩散模型:生成式AI革命正在隐空间中发生吗?，133页ppt

【NeurIPS 2023教程】隐扩散模型:生成式AI革命正在隐空间中发生吗?，133页ppt

专知会员服务

54+阅读 · 2023年12月15日

【MIT博士论文】保证性生成模型，155页pdf

【MIT博士论文】保证性生成模型，155页pdf

专知会员服务

31+阅读 · 2023年8月8日

【ACL2023】从语言模型生成文本,140页ppt

【ACL2023】从语言模型生成文本,140页ppt

专知会员服务

45+阅读 · 2023年7月10日

【ECCV2022教程】基于自动驾驶数据的自监督学习研究进展，220页ppt

【ECCV2022教程】基于自动驾驶数据的自监督学习研究进展，220页ppt

专知会员服务

30+阅读 · 2022年10月26日

【DeepMind】结构化数据少样本学习，51页ppt

【DeepMind】结构化数据少样本学习，51页ppt

专知会员服务

34+阅读 · 2022年8月13日

【ICML2022教程】效度，可靠性和意义:可复现机器学习的统计方法教程，147页ppt

【ICML2022教程】效度，可靠性和意义:可复现机器学习的统计方法教程，147页ppt

专知会员服务

16+阅读 · 2022年7月20日

【伦敦大学皇家霍洛威学院博士论文】机器学习概率预测，171页pdf

【伦敦大学皇家霍洛威学院博士论文】机器学习概率预测，171页pdf

专知会员服务

35+阅读 · 2022年6月26日

【AAAI2021】知识迁移的机器学习成员隐私保护，57页ppt

【AAAI2021】知识迁移的机器学习成员隐私保护，57页ppt

专知会员服务

28+阅读 · 2021年2月9日

【IJCAI】大规模可扩展深度学习，82页ppt

【IJCAI】大规模可扩展深度学习，82页ppt

专知会员服务

29+阅读 · 2021年1月10日

【ECAI2020】可扩展深度学习: 理论与算法，120页ppt

【ECAI2020】可扩展深度学习: 理论与算法，120页ppt

专知会员服务

28+阅读 · 2020年9月25日

【佐治亚理工博士论文】基于策略智能体和有限反馈的序列决策，211页pdf

【佐治亚理工博士论文】基于策略智能体和有限反馈的序列决策，211页pdf

专知

38+阅读 · 2023年4月13日

【斯坦福博士论文】深度学习核编译为局部感知数据流，109页pdf

【斯坦福博士论文】深度学习核编译为局部感知数据流，109页pdf

专知

5+阅读 · 2023年4月5日

【硬核书】数据科学，282页pdf

【硬核书】数据科学，282页pdf

专知

26+阅读 · 2022年11月29日

【斯坦福博士论文】利用先验知识和结构进行数据高效的机器学习，154页pdf

【斯坦福博士论文】利用先验知识和结构进行数据高效的机器学习，154页pdf

专知

28+阅读 · 2022年9月11日

【2022新书】联邦学习：方法和应用的综合概述，531页pdf

【2022新书】联邦学习：方法和应用的综合概述，531页pdf

专知

27+阅读 · 2022年7月14日

【2022新书】知识表示和机器学习的预测和分析，232页pdf

【2022新书】知识表示和机器学习的预测和分析，232页pdf

专知

41+阅读 · 2022年3月12日

【KDD2020】图神经网络:基础与应用，322页ppt

【KDD2020】图神经网络:基础与应用，322页ppt

专知

35+阅读 · 2020年8月29日

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

最新《可解释深度学习XDL》2020研究进展综述大全，54页pdf

最新《可解释深度学习XDL》2020研究进展综述大全，54页pdf

专知

36+阅读 · 2020年5月2日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

专知

71+阅读 · 2020年2月29日

结合知识图谱的概率话题模型研究

国家自然科学基金

10+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

基于对称识别方法的贝叶斯probit模型稳健性研究

国家自然科学基金

3+阅读 · 2015年12月31日

高维回归模型的预测稳定性研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于多样化查询的多标记主动学习研究

国家自然科学基金

0+阅读 · 2015年12月31日

不确定知识图谱中面向结构查询的众包清洗研究

国家自然科学基金

4+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

试验设计中的模型选择

国家自然科学基金

6+阅读 · 2014年12月31日

大数据环境下基于GMDH的客户分类半监督集成模型研究

国家自然科学基金

1+阅读 · 2014年12月31日

变换结构方程模型的非参数贝叶斯分析

国家自然科学基金

3+阅读 · 2014年12月31日

Deep graph kernel point processes

Deep graph kernel point processes

Arxiv

0+阅读 · 2024年2月2日

Is ChatGPT a Good Recommender? A Preliminary Study

Arxiv

171+阅读 · 2023年4月20日

NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models

Arxiv

42+阅读 · 2023年4月19日

A Comprehensive Survey on Deep Graph Representation Learning

Arxiv

103+阅读 · 2023年4月11日

On Efficient Training of Large-Scale Deep Learning Models: A Literature Review

Arxiv

216+阅读 · 2023年4月7日

A Survey of Large Language Models

A Survey of Large Language Models

Arxiv

478+阅读 · 2023年3月31日

Knowledge Graphs: Opportunities and Challenges

Arxiv

174+阅读 · 2023年3月24日

Sparks of Artificial General Intelligence: Early experiments with GPT-4

Arxiv

51+阅读 · 2023年3月22日

A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?

Arxiv

84+阅读 · 2023年3月21日

Data-centric Artificial Intelligence: A Survey

Arxiv

24+阅读 · 2023年3月17日

VIP会员

相关主题

超参数调优

相关VIP内容

【NeurIPS 2023教程】隐扩散模型:生成式AI革命正在隐空间中发生吗?，133页ppt

【NeurIPS 2023教程】隐扩散模型:生成式AI革命正在隐空间中发生吗?，133页ppt

专知会员服务

54+阅读 · 2023年12月15日

【MIT博士论文】保证性生成模型，155页pdf

【MIT博士论文】保证性生成模型，155页pdf

专知会员服务

31+阅读 · 2023年8月8日

【ACL2023】从语言模型生成文本,140页ppt

【ACL2023】从语言模型生成文本,140页ppt

专知会员服务

45+阅读 · 2023年7月10日

【ECCV2022教程】基于自动驾驶数据的自监督学习研究进展，220页ppt

【ECCV2022教程】基于自动驾驶数据的自监督学习研究进展，220页ppt

专知会员服务

30+阅读 · 2022年10月26日

【DeepMind】结构化数据少样本学习，51页ppt

【DeepMind】结构化数据少样本学习，51页ppt

专知会员服务

34+阅读 · 2022年8月13日

【ICML2022教程】效度，可靠性和意义:可复现机器学习的统计方法教程，147页ppt

【ICML2022教程】效度，可靠性和意义:可复现机器学习的统计方法教程，147页ppt

专知会员服务

16+阅读 · 2022年7月20日

【伦敦大学皇家霍洛威学院博士论文】机器学习概率预测，171页pdf

【伦敦大学皇家霍洛威学院博士论文】机器学习概率预测，171页pdf

专知会员服务

35+阅读 · 2022年6月26日

【AAAI2021】知识迁移的机器学习成员隐私保护，57页ppt

【AAAI2021】知识迁移的机器学习成员隐私保护，57页ppt

专知会员服务

28+阅读 · 2021年2月9日

【IJCAI】大规模可扩展深度学习，82页ppt

【IJCAI】大规模可扩展深度学习，82页ppt

专知会员服务

29+阅读 · 2021年1月10日

【ECAI2020】可扩展深度学习: 理论与算法，120页ppt

【ECAI2020】可扩展深度学习: 理论与算法，120页ppt

专知会员服务

28+阅读 · 2020年9月25日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

【佐治亚理工博士论文】基于策略智能体和有限反馈的序列决策，211页pdf

【佐治亚理工博士论文】基于策略智能体和有限反馈的序列决策，211页pdf

专知

38+阅读 · 2023年4月13日

【斯坦福博士论文】深度学习核编译为局部感知数据流，109页pdf

【斯坦福博士论文】深度学习核编译为局部感知数据流，109页pdf

专知

5+阅读 · 2023年4月5日

【硬核书】数据科学，282页pdf

【硬核书】数据科学，282页pdf

专知

26+阅读 · 2022年11月29日

【斯坦福博士论文】利用先验知识和结构进行数据高效的机器学习，154页pdf

【斯坦福博士论文】利用先验知识和结构进行数据高效的机器学习，154页pdf

专知

28+阅读 · 2022年9月11日

【2022新书】联邦学习：方法和应用的综合概述，531页pdf

【2022新书】联邦学习：方法和应用的综合概述，531页pdf

专知

27+阅读 · 2022年7月14日

【2022新书】知识表示和机器学习的预测和分析，232页pdf

【2022新书】知识表示和机器学习的预测和分析，232页pdf

专知

41+阅读 · 2022年3月12日

【KDD2020】图神经网络:基础与应用，322页ppt

【KDD2020】图神经网络:基础与应用，322页ppt

专知

35+阅读 · 2020年8月29日

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

最新《可解释深度学习XDL》2020研究进展综述大全，54页pdf

最新《可解释深度学习XDL》2020研究进展综述大全，54页pdf

专知

36+阅读 · 2020年5月2日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

专知

71+阅读 · 2020年2月29日

相关基金

结合知识图谱的概率话题模型研究

国家自然科学基金

10+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

基于对称识别方法的贝叶斯probit模型稳健性研究

国家自然科学基金

3+阅读 · 2015年12月31日

高维回归模型的预测稳定性研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于多样化查询的多标记主动学习研究

国家自然科学基金

0+阅读 · 2015年12月31日

不确定知识图谱中面向结构查询的众包清洗研究

国家自然科学基金

4+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

试验设计中的模型选择

国家自然科学基金

6+阅读 · 2014年12月31日

大数据环境下基于GMDH的客户分类半监督集成模型研究

国家自然科学基金

1+阅读 · 2014年12月31日

变换结构方程模型的非参数贝叶斯分析

国家自然科学基金

3+阅读 · 2014年12月31日

相关论文

Deep graph kernel point processes

Deep graph kernel point processes

Arxiv

0+阅读 · 2024年2月2日

Is ChatGPT a Good Recommender? A Preliminary Study

Arxiv

171+阅读 · 2023年4月20日

NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models

Arxiv

42+阅读 · 2023年4月19日

A Comprehensive Survey on Deep Graph Representation Learning

Arxiv

103+阅读 · 2023年4月11日

On Efficient Training of Large-Scale Deep Learning Models: A Literature Review

Arxiv

216+阅读 · 2023年4月7日

A Survey of Large Language Models

A Survey of Large Language Models

Arxiv

478+阅读 · 2023年3月31日

Knowledge Graphs: Opportunities and Challenges

Arxiv

174+阅读 · 2023年3月24日

Sparks of Artificial General Intelligence: Early experiments with GPT-4

Arxiv

51+阅读 · 2023年3月22日

A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?

Arxiv

84+阅读 · 2023年3月21日

Data-centric Artificial Intelligence: A Survey

Arxiv

24+阅读 · 2023年3月17日

微信扫码咨询专知VIP会员