共同变换下线性倒退的训练前调整-工程前调整的功率和限制 (The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift) - 专知论文

会员服务 ·

0

协变量偏移 · 线性回归 · Learning · 线性的 · 可约的 ·

2022 年 8 月 3 日

The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

翻译：共同变换下线性倒退的训练前调整-工程前调整的功率和限制

Jingfeng Wu,Difan Zou,Vladimir Braverman,Quanquan Gu,Sham M. Kakade

from arxiv, 32 pages, 1 figure, 1 table

We study linear regression under covariate shift, where the marginal distribution over the input covariates differs in the source and the target domains, while the conditional distribution of the output given the input covariates is similar across the two domains. We investigate a transfer learning approach with pretraining on the source data and finetuning based on the target data (both conducted by online SGD) for this problem. We establish sharp instance-dependent excess risk upper and lower bounds for this approach. Our bounds suggest that for a large class of linear regression instances, transfer learning with $O(N^2)$ source data (and scarce or no target data) is as effective as supervised learning with $N$ target data. In addition, we show that finetuning, even with only a small amount of target data, could drastically reduce the amount of source data required by pretraining. Our theory sheds light on the effectiveness and limitation of pretraining as well as the benefits of finetuning for tackling covariate shift problems.

翻译：在共变式转变下,我们研究线性回归,在源和目标领域,输入共变的边际分布不同,而输入共变的附带产出的有条件分布在两个领域类似。我们调查一种转让学习方法,对源数据进行预先培训,并根据目标数据(由在线 SGD 进行)微调这一问题。我们为这一方法确立了明显依赖实例的超大风险上限和下限。我们的界限表明,对于一大类线性回归案例来说,用$O(N)2)的源数据(和稀缺或没有目标数据)进行转移学习,与用$N美元的目标数据进行监管学习一样有效。此外,我们显示微调,即使只有少量目标数据,也能大幅降低培训前所需的源数据数量。我们的理论揭示了培训前培训的有效性和局限性,以及用美元源数据(以及稀缺或没有目标数据)进行微调解决共变换问题的好处。

0

相关内容

协变量偏移

协变量偏移

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Chemerin通过调节p38MAPK通路参与动脉粥样硬化分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

利用转基因斑马鱼研究abcb4基因在肿瘤耐药中的机制

国家自然科学基金

1+阅读 · 2014年12月31日

SIRT1基因诱导肿瘤细胞辐射耐受性的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Yb3+、Ca2+离子共掺新型硼硅酸盐超快激光晶体的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

约束型对流扩散最优控制问题的特征有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

PANDER-FOXO1信号通路在非酒精性脂肪肝发生过程中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程稳定化有限元方法后验误差估计

国家自然科学基金

0+阅读 · 2011年12月31日

Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies

Arxiv

0+阅读 · 2022年9月30日

Risk Control for Online Learning Models

Arxiv

0+阅读 · 2022年9月30日

Non-asymptotic Optimal Prediction Error for Growing-dimensional Partially Functional Linear Models

Arxiv

0+阅读 · 2022年9月30日

Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders

Arxiv

0+阅读 · 2022年9月30日

Generalizability of Adversarial Robustness Under Distribution Shifts

Arxiv

0+阅读 · 2022年9月29日

Understanding the Role of Nonlinearity in Training Dynamics of Contrastive Learning

Arxiv

0+阅读 · 2022年9月29日

Joint Embedding Self-Supervised Learning in the Kernel Regime

Arxiv

0+阅读 · 2022年9月29日

On Transfer Learning in Functional Linear Regression

Arxiv

0+阅读 · 2022年9月29日

Estimation of prediction error with known covariate shift

Arxiv

0+阅读 · 2022年9月29日

Algorithm Unfolding for Block-sparse and MMV Problems with Reduced Training Overhead

Arxiv

0+阅读 · 2022年9月28日

VIP会员

文章信息

相关主题

协变量偏移

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies

Arxiv

0+阅读 · 2022年9月30日

Risk Control for Online Learning Models

Arxiv

0+阅读 · 2022年9月30日

Non-asymptotic Optimal Prediction Error for Growing-dimensional Partially Functional Linear Models

Arxiv

0+阅读 · 2022年9月30日

Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders

Arxiv

0+阅读 · 2022年9月30日

Generalizability of Adversarial Robustness Under Distribution Shifts

Arxiv

0+阅读 · 2022年9月29日

Understanding the Role of Nonlinearity in Training Dynamics of Contrastive Learning

Arxiv

0+阅读 · 2022年9月29日

Joint Embedding Self-Supervised Learning in the Kernel Regime

Arxiv

0+阅读 · 2022年9月29日

On Transfer Learning in Functional Linear Regression

Arxiv

0+阅读 · 2022年9月29日

Estimation of prediction error with known covariate shift

Arxiv

0+阅读 · 2022年9月29日

Algorithm Unfolding for Block-sparse and MMV Problems with Reduced Training Overhead

Arxiv

0+阅读 · 2022年9月28日

相关基金

Chemerin通过调节p38MAPK通路参与动脉粥样硬化分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

利用转基因斑马鱼研究abcb4基因在肿瘤耐药中的机制

国家自然科学基金

1+阅读 · 2014年12月31日

SIRT1基因诱导肿瘤细胞辐射耐受性的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Yb3+、Ca2+离子共掺新型硼硅酸盐超快激光晶体的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

约束型对流扩散最优控制问题的特征有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

PANDER-FOXO1信号通路在非酒精性脂肪肝发生过程中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程稳定化有限元方法后验误差估计

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员