高维回归的信息瓶颈理论:相关性、效率和最佳性 (Information bottleneck theory of high-dimensional regression: relevancy, efficiency and optimality) - 专知论文

会员服务 ·

0

INFORMS · 优化器 · Learning · 过拟合 · 线性的 ·

2022 年 8 月 8 日

Information bottleneck theory of high-dimensional regression: relevancy, efficiency and optimality

翻译：高维回归的信息瓶颈理论:相关性、效率和最佳性

Vudtiwat Ngampruetikorn,David J. Schwab

from arxiv, 11 pages, 4 figures

Avoiding overfitting is a central challenge in machine learning, yet many large neural networks readily achieve zero training loss. This puzzling contradiction necessitates new approaches to the study of overfitting. Here we quantify overfitting via residual information, defined as the bits in fitted models that encode noise in training data. Information efficient learning algorithms minimize residual information while maximizing the relevant bits, which are predictive of the unknown generative models. We solve this optimization to obtain the information content of optimal algorithms for a linear regression problem and compare it to that of randomized ridge regression. Our results demonstrate the fundamental tradeoff between residual and relevant information and characterize the relative information efficiency of randomized regression with respect to optimal algorithms. Finally, using results from random matrix theory, we reveal the information complexity of learning a linear map in high dimensions and unveil information-theoretic analogs of double and multiple descent phenomena.

翻译：避免超装是机器学习的一个中心挑战,但许多大型神经网络很容易实现零培训损失。这种令人费解的矛盾要求以新的方法来研究超装问题。我们在这里通过残余信息量化超装问题, 被定义为在培训数据中编码噪音的适合模型中的比特。信息高效的学习算法最大限度地减少残余信息, 同时又最大限度地扩大相关比特, 这些比特是未知的基因化模型的预测。我们解决了这一优化, 以获得线性回归问题最佳算法的信息内容, 并将其与随机的脊脊回归进行比较。我们的结果显示了剩余信息与相关信息之间的基本平衡, 并说明了随机回归在优化算法方面的相对信息效率。最后, 我们利用随机矩阵理论的结果, 揭示了在高维度上学习线性地图的信息复杂性, 并展示了双、多重血统现象的信息理论模拟。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

专知会员服务

66+阅读 · 2019年12月20日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

含功能性构筑单元扩展卟啉的合成与性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

稀土MOF纳米荧光探针的设计合成及其生物应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于羟肟酸配体的新型金属冠醚的设计合成、结构与性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

多孔发光稀土金属-有机骨架材料的合成与药物缓释研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多模态磁共振成像2型糖尿病脑病嗅觉与认知障碍的机制探讨与干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

混合组织纳米结构金属材料强韧性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

CIECAM02拓展研究

国家自然科学基金

0+阅读 · 2011年12月31日

以表面活性离子液体为模版制备介孔二氧化硅材料

国家自然科学基金

0+阅读 · 2009年12月31日

多功能有机膦酸盐的结构规律与性能

国家自然科学基金

0+阅读 · 2008年12月31日

Probabilistic partition of unity networks for high-dimensional regression problems

Arxiv

0+阅读 · 2022年10月6日

On the duality between contrastive and non-contrastive self-supervised learning

Arxiv

0+阅读 · 2022年10月5日

Statistical Inference for Genetic Relatedness Based on High-Dimensional Logistic Regression

Arxiv

0+阅读 · 2022年10月5日

Inference on High-dimensional Single-index Models with Streaming Data

Arxiv

0+阅读 · 2022年10月3日

Statistical Efficiency of Score Matching: The View from Isoperimetry

Arxiv

0+阅读 · 2022年10月3日

High-dimensional Censored Regression via the Penalized Tobit Likelihood

Arxiv

0+阅读 · 2022年10月3日

Inferring Manifolds From Noisy Data Using Gaussian Processes

Arxiv

0+阅读 · 2022年10月2日

Primal-dual regression approach for Markov decision processes with general state and action space

Arxiv

0+阅读 · 2022年10月1日

Low-rank Latent Matrix-factor Prediction Modeling for Generalized High-dimensional Matrix-variate Regression

Arxiv

0+阅读 · 2022年10月1日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

专知会员服务

66+阅读 · 2019年12月20日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《步兵小单元山地严寒作战指南》美军最新条令200页

《联合作战概念的发展》最新报告

俄制无人机弹药

《复杂场景下自主着陆的模型预测控制技术》92页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Probabilistic partition of unity networks for high-dimensional regression problems

Arxiv

0+阅读 · 2022年10月6日

On the duality between contrastive and non-contrastive self-supervised learning

Arxiv

0+阅读 · 2022年10月5日

Statistical Inference for Genetic Relatedness Based on High-Dimensional Logistic Regression

Arxiv

0+阅读 · 2022年10月5日

Inference on High-dimensional Single-index Models with Streaming Data

Arxiv

0+阅读 · 2022年10月3日

Statistical Efficiency of Score Matching: The View from Isoperimetry

Arxiv

0+阅读 · 2022年10月3日

High-dimensional Censored Regression via the Penalized Tobit Likelihood

Arxiv

0+阅读 · 2022年10月3日

Inferring Manifolds From Noisy Data Using Gaussian Processes

Arxiv

0+阅读 · 2022年10月2日

Primal-dual regression approach for Markov decision processes with general state and action space

Arxiv

0+阅读 · 2022年10月1日

Low-rank Latent Matrix-factor Prediction Modeling for Generalized High-dimensional Matrix-variate Regression

Arxiv

0+阅读 · 2022年10月1日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

相关基金

含功能性构筑单元扩展卟啉的合成与性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

稀土MOF纳米荧光探针的设计合成及其生物应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于羟肟酸配体的新型金属冠醚的设计合成、结构与性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

多孔发光稀土金属-有机骨架材料的合成与药物缓释研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多模态磁共振成像2型糖尿病脑病嗅觉与认知障碍的机制探讨与干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

混合组织纳米结构金属材料强韧性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

CIECAM02拓展研究

国家自然科学基金

0+阅读 · 2011年12月31日

以表面活性离子液体为模版制备介孔二氧化硅材料

国家自然科学基金

0+阅读 · 2009年12月31日

多功能有机膦酸盐的结构规律与性能

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员