高多位随机地貌回归中的共变移动 (Covariate Shift in High-Dimensional Random Feature Regression) - 专知论文

会员服务 ·

0

协变量偏移 · Performance · 稳健性 · Machine Learning · 确切的 ·

2021 年 11 月 16 日

Covariate Shift in High-Dimensional Random Feature Regression

翻译：高多位随机地貌回归中的共变移动

Nilesh Tripuraneni,Ben Adlam,Jeffrey Pennington

from arxiv, 107 pages, 10 figures

A significant obstacle in the development of robust machine learning models is covariate shift, a form of distribution shift that occurs when the input distributions of the training and test sets differ while the conditional label distributions remain the same. Despite the prevalence of covariate shift in real-world applications, a theoretical understanding in the context of modern machine learning has remained lacking. In this work, we examine the exact high-dimensional asymptotics of random feature regression under covariate shift and present a precise characterization of the limiting test error, bias, and variance in this setting. Our results motivate a natural partial order over covariate shifts that provides a sufficient condition for determining when the shift will harm (or even help) test performance. We find that overparameterized models exhibit enhanced robustness to covariate shift, providing one of the first theoretical explanations for this intriguing phenomenon. Additionally, our analysis reveals an exact linear relationship between in-distribution and out-of-distribution generalization performance, offering an explanation for this surprising recent empirical observation.

翻译：在开发强大的机器学习模型方面,一个重大障碍是共变式,这是一种分配式转变的形式,当培训和测试组的投入分布不同时,就会发生一种分配式转变,而有条件的标签分布则保持不变。尽管现实世界应用中普遍存在共变式转变,但在现代机器学习方面仍然缺乏理论理解。在这项工作中,我们研究了在共变式转变下随机特征回归的精确高维杂念,并对这一环境的有限测试错误、偏差和差异作了精确的描述。我们的结果促使对共变式变化进行自然的局部排序,为确定这种转变何时会损害(甚至帮助)测试性能提供了充分的条件。我们发现,过度的参数模型显示了共变的强大性,为这种引人入胜的现象提供了初步的理论解释之一。此外,我们的分析揭示了分配和分配外概括性表现之间的精确线性关系,为最近令人惊讶的经验性观察提供了解释。

0

相关内容

协变量偏移

协变量偏移

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

【ICML2021】随机森林机器遗忘

专知会员服务

21+阅读 · 2021年8月9日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

【伯克利】机器学习蛋白质工程，Machine learning for protein engineering，83页ppt

【伯克利】机器学习蛋白质工程，Machine learning for protein engineering，83页ppt

专知会员服务

36+阅读 · 2020年5月9日

【CVPR2020】对抗特征幻觉网络的小样本学习，Adversarial Feature Hallucination Networks for Few-Shot Learning

【CVPR2020】对抗特征幻觉网络的小样本学习，Adversarial Feature Hallucination Networks for Few-Shot Learning

专知会员服务

50+阅读 · 2020年3月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

已删除

将门创投

8+阅读 · 2019年7月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Parallel and distributed Bayesian modelling for analysing high-dimensional spatio-temporal count data

Arxiv

0+阅读 · 2022年1月20日

Scalable $k$-d trees for distributed data

Arxiv

0+阅读 · 2022年1月20日

Kernel Methods and Multi-layer Perceptrons Learn Linear Models in High Dimensions

Arxiv

0+阅读 · 2022年1月20日

Inference in High-dimensional Multivariate Response Regression with Hidden Variables

Arxiv

0+阅读 · 2022年1月20日

Error analysis for a statistical finite element method

Arxiv

0+阅读 · 2022年1月19日

Utility Analysis and Enhancement of LDP Mechanisms in High-Dimensional Space

Arxiv

0+阅读 · 2022年1月19日

Risk-Monotonicity in Statistical Learning

Arxiv

0+阅读 · 2022年1月15日

A generalized likelihood based Bayesian approach for scalable joint regression and covariance selection in high dimensions

Arxiv

0+阅读 · 2022年1月14日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Deformable ConvNets v2: More Deformable, Better Results

Deformable ConvNets v2: More Deformable, Better Results

Arxiv

4+阅读 · 2018年11月27日

VIP会员

文章信息

相关主题

协变量偏移

Machine Learning

相关VIP内容

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

【ICML2021】随机森林机器遗忘

专知会员服务

21+阅读 · 2021年8月9日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

【伯克利】机器学习蛋白质工程，Machine learning for protein engineering，83页ppt

【伯克利】机器学习蛋白质工程，Machine learning for protein engineering，83页ppt

专知会员服务

36+阅读 · 2020年5月9日

【CVPR2020】对抗特征幻觉网络的小样本学习，Adversarial Feature Hallucination Networks for Few-Shot Learning

【CVPR2020】对抗特征幻觉网络的小样本学习，Adversarial Feature Hallucination Networks for Few-Shot Learning

专知会员服务

50+阅读 · 2020年3月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大型语言模型遇上文本属性图：一种融合框架与应用的综述

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

【博士论文】用于概率程序与生成模型的变分推断

军事指挥控制系统：2025年5种用途

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

已删除

将门创投

8+阅读 · 2019年7月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Parallel and distributed Bayesian modelling for analysing high-dimensional spatio-temporal count data

Arxiv

0+阅读 · 2022年1月20日

Scalable $k$-d trees for distributed data

Arxiv

0+阅读 · 2022年1月20日

Kernel Methods and Multi-layer Perceptrons Learn Linear Models in High Dimensions

Arxiv

0+阅读 · 2022年1月20日

Inference in High-dimensional Multivariate Response Regression with Hidden Variables

Arxiv

0+阅读 · 2022年1月20日

Error analysis for a statistical finite element method

Arxiv

0+阅读 · 2022年1月19日

Utility Analysis and Enhancement of LDP Mechanisms in High-Dimensional Space

Arxiv

0+阅读 · 2022年1月19日

Risk-Monotonicity in Statistical Learning

Arxiv

0+阅读 · 2022年1月15日

A generalized likelihood based Bayesian approach for scalable joint regression and covariance selection in high dimensions

Arxiv

0+阅读 · 2022年1月14日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Deformable ConvNets v2: More Deformable, Better Results

Deformable ConvNets v2: More Deformable, Better Results

Arxiv

4+阅读 · 2018年11月27日

微信扫码咨询专知VIP会员