山脊回归时的防水层过深 (Benign overfitting in ridge regression) - 专知论文

会员服务 ·

0

岭回归 · 相互独立的 · 过拟合 · 情景 · 噪声 ·

2022 年 12 月 6 日

Benign overfitting in ridge regression

翻译：山脊回归时的防水层过深

A. Tsigler,P. L. Bartlett

from arxiv, 76 pages; completely rewrote the old version. Enhanced introduction, comparisons to other papers, and results on negative regularization

In many modern applications of deep learning the neural network has many more parameters than the data points used for its training. Motivated by those practices, a large body of recent theoretical research has been devoted to studying overparameterized models. One of the central phenomena in this regime is the ability of the model to interpolate noisy data, but still have test error lower than the amount of noise in that data. arXiv:1906.11300 characterized for which covariance structure of the data such a phenomenon can happen in linear regression if one considers the interpolating solution with minimum $\ell_2$-norm and the data has independent components: they gave a sharp bound on the variance term and showed that it can be small if and only if the data covariance has high effective rank in a subspace of small co-dimension. We strengthen and complete their results by eliminating the independence assumption and providing sharp bounds for the bias term. Thus, our results apply in a much more general setting than those of arXiv:1906.11300, e.g., kernel regression, and not only characterize how the noise is damped but also which part of the true signal is learned. Moreover, we extend the result to the setting of ridge regression, which allows us to explain another interesting phenomenon: we give general sufficient conditions under which the optimal regularization is negative.

翻译：在许多深层学习的现代应用中,神经网络的参数比其培训所用的数据点要多得多。在这些做法的推动下,最近大量理论研究都致力于研究过分分化模型。这个制度的一个中心现象是模型能够内插噪音数据,但是仍然有比数据中噪音量低的测试错误。 arxiv:1906.11300的特征是,这些数据的共变结构在线性回归中可以发生,如果考虑到最小值为$_2美元-诺尔姆和数据具有独立组成部分的内插解决方案:它们对差异术语作了鲜明的限定,并且表明只有数据共变异在小相混合的子空间中具有高度有效等级时,它才可能很小。我们通过消除独立假设和为偏差术语提供尖的界限来加强和完成它们的结果。因此,我们的结果在比arxiv:1906.11300的负面环境更普遍应用,例如,内核倒退,并且不仅说明我们如何用最精确的信号来解释我们如何在一般的回归中测深层次上,我们又能够解释另一个令人兴奋的结果。

0

相关内容

岭回归

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

支气管上皮细胞klotho表达在慢性阻塞性肺气肿形成中作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

联合过表达CXCR4基因的骨髓间充质干细胞和骨髓内皮前体细胞促进小体积移植肝再生及其机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

骨髓微环境调控骨髓瘤细胞RANKL表达的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

毛茛科中花瓣缺失的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

猪繁殖与呼吸综合征病毒ORF1b影响其致病性的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

Find-me和Eat-me信号在NOD.H-2h4 小鼠自身免疫甲状腺炎发病机制中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

具有双稳态特性的复合材料结构粘弹性模型与变形机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

MRTF-A调控CYR61介导间充质干细胞向内皮细胞分化的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

猪繁殖与呼吸综合征病毒Nsp1蛋白的锌指结构影响机体I型干扰素信号通路的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

深亚微米集成电路铜互连线的宏、微观织构研究

国家自然科学基金

0+阅读 · 2008年12月31日

Bandwidth Selection for Gaussian Kernel Ridge Regression via Jacobian Control

Arxiv

0+阅读 · 2023年2月8日

A Bipartite Ranking Approach to the Two-Sample Problem

Arxiv

0+阅读 · 2023年2月7日

Scalable inference in functional linear regression with streaming data

Arxiv

0+阅读 · 2023年2月5日

$\ell_1$-penalized Multinomial Regression: Estimation, inference, and prediction, with an application to risk factor identification for different dementia subtypes

Arxiv

0+阅读 · 2023年2月5日

A Simple Approach for Local and Global Variable Importance in Nonlinear Regression Models

Arxiv

0+阅读 · 2023年2月3日

Neural Networks for Symbolic Regression

Arxiv

0+阅读 · 2023年2月3日

Characterization and estimation of high dimensional sparse regression parameters under linear inequality constraints

Arxiv

0+阅读 · 2023年2月3日

PINN Training using Biobjective Optimization: The Trade-off between Data Loss and Residual Loss

Arxiv

0+阅读 · 2023年2月3日

Fast Feature Selection with Fairness Constraints

Arxiv

0+阅读 · 2023年2月3日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

VIP会员

文章信息

相关主题

相互独立的

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Bandwidth Selection for Gaussian Kernel Ridge Regression via Jacobian Control

Arxiv

0+阅读 · 2023年2月8日

A Bipartite Ranking Approach to the Two-Sample Problem

Arxiv

0+阅读 · 2023年2月7日

Scalable inference in functional linear regression with streaming data

Arxiv

0+阅读 · 2023年2月5日

$\ell_1$-penalized Multinomial Regression: Estimation, inference, and prediction, with an application to risk factor identification for different dementia subtypes

Arxiv

0+阅读 · 2023年2月5日

A Simple Approach for Local and Global Variable Importance in Nonlinear Regression Models

Arxiv

0+阅读 · 2023年2月3日

Neural Networks for Symbolic Regression

Arxiv

0+阅读 · 2023年2月3日

Characterization and estimation of high dimensional sparse regression parameters under linear inequality constraints

Arxiv

0+阅读 · 2023年2月3日

PINN Training using Biobjective Optimization: The Trade-off between Data Loss and Residual Loss

Arxiv

0+阅读 · 2023年2月3日

Fast Feature Selection with Fairness Constraints

Arxiv

0+阅读 · 2023年2月3日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

相关基金

支气管上皮细胞klotho表达在慢性阻塞性肺气肿形成中作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

联合过表达CXCR4基因的骨髓间充质干细胞和骨髓内皮前体细胞促进小体积移植肝再生及其机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

骨髓微环境调控骨髓瘤细胞RANKL表达的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

毛茛科中花瓣缺失的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

猪繁殖与呼吸综合征病毒ORF1b影响其致病性的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

Find-me和Eat-me信号在NOD.H-2h4 小鼠自身免疫甲状腺炎发病机制中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

具有双稳态特性的复合材料结构粘弹性模型与变形机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

MRTF-A调控CYR61介导间充质干细胞向内皮细胞分化的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

猪繁殖与呼吸综合征病毒Nsp1蛋白的锌指结构影响机体I型干扰素信号通路的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

深亚微米集成电路铜互连线的宏、微观织构研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员