培训前模式对分配转移的健全程度如何? (How robust are pre-trained models to distribution shift?) - 专知论文

会员服务 ·

0

Learning · Performer · MoDELS · 稳健性 · 线性的 ·

2022 年 6 月 17 日

How robust are pre-trained models to distribution shift?

翻译：培训前模式对分配转移的健全程度如何?

Yuge Shi,Imant Daunhawer,Julia E. Vogt,Philip H. S. Torr,Amartya Sanyal

The vulnerability of machine learning models to spurious correlations has mostly been discussed in the context of supervised learning (SL). However, there is a lack of insight on how spurious correlations affect the performance of popular self-supervised learning (SSL) and auto-encoder based models (AE). In this work, we shed light on this by evaluating the performance of these models on both real world and synthetic distribution shift datasets. Following observations that the linear head itself can be susceptible to spurious correlations, we develop a novel evaluation scheme with the linear head trained on out-of-distribution (OOD) data, to isolate the performance of the pre-trained models from a potential bias of the linear head used for evaluation. With this new methodology, we show that SSL models are consistently more robust to distribution shifts and thus better at OOD generalisation than AE and SL models.

翻译：机器学习模型在虚假关联方面的脆弱性,大部分是在监督学习的背景下讨论的(SL),然而,对于虚假关联如何影响以自我监督学习和自动编码为基础的模型(AE)的绩效,我们缺乏洞察力。在这项工作中,我们通过在真实世界和合成分布转换数据集中评价这些模型的绩效来阐明这一点。在观察到线性头本身容易受到虚假关联的情况下,我们与受过关于传播外数据培训的线性头(OOOD)开发了一个新的评价计划,以将预先培训的模型的性能与用于评价的线性头的潜在偏差区分开来。我们用这一新的方法表明,SLSL模型一贯地更强有力地适应分布变化,从而在OD通用方面比AE和SL模型更好。

0

相关内容

Learning

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Sigma 1受体对血管性痴呆小鼠血脑屏障的调节作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

利用贝叶斯方法估计LAMOST恒星参数

国家自然科学基金

2+阅读 · 2015年12月31日

含碳气溶胶光谱特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

BiFeO3基薄膜和异质结外电场下磁电响应及耦合研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型苯并二噻吩类二维共轭聚合物的共轭支链的炔类合成及其光伏性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

风光储联合发电系统中的有功功率控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁性掺杂的哈斯勒半导体类薄膜的制备和反常霍尔效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

调节Kupffer细胞介导炎症通路改善追赶生长所致胰岛素抵抗

国家自然科学基金

0+阅读 · 2011年12月31日

超导量子电路中量子态的测量和控制

国家自然科学基金

0+阅读 · 2009年12月31日

ERK1/2抑制剂与LXR配体的组合疗法在防治动脉粥样硬化中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

A Model-Oriented Approach for Lifting Symmetries in Answer Set Programming

Arxiv

0+阅读 · 2022年8月5日

Efficiently Generating Independent Samples Directly from the Posterior Distribution for a Large Class of Bayesian Generalized Linear Mixed Effects Models

Arxiv

0+阅读 · 2022年8月4日

Analyzing Data-Centric Properties for Contrastive Learning on Graphs

Arxiv

0+阅读 · 2022年8月4日

Unconventional application of k-means for distributed approximate similarity search

Unconventional application of k-means for distributed approximate similarity search

Arxiv

0+阅读 · 2022年8月4日

The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

Arxiv

0+阅读 · 2022年8月3日

Robustness Implies Generalization via Data-Dependent Generalization Bounds

Arxiv

0+阅读 · 2022年8月3日

Post-hoc Interpretability based Parameter Selection for Data Oriented Nuclear Reactor Accident Diagnosis System

Arxiv

0+阅读 · 2022年8月3日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

A Model-Oriented Approach for Lifting Symmetries in Answer Set Programming

Arxiv

0+阅读 · 2022年8月5日

Efficiently Generating Independent Samples Directly from the Posterior Distribution for a Large Class of Bayesian Generalized Linear Mixed Effects Models

Arxiv

0+阅读 · 2022年8月4日

Analyzing Data-Centric Properties for Contrastive Learning on Graphs

Arxiv

0+阅读 · 2022年8月4日

Unconventional application of k-means for distributed approximate similarity search

Unconventional application of k-means for distributed approximate similarity search

Arxiv

0+阅读 · 2022年8月4日

The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

Arxiv

0+阅读 · 2022年8月3日

Robustness Implies Generalization via Data-Dependent Generalization Bounds

Arxiv

0+阅读 · 2022年8月3日

Post-hoc Interpretability based Parameter Selection for Data Oriented Nuclear Reactor Accident Diagnosis System

Arxiv

0+阅读 · 2022年8月3日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

相关基金

Sigma 1受体对血管性痴呆小鼠血脑屏障的调节作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

利用贝叶斯方法估计LAMOST恒星参数

国家自然科学基金

2+阅读 · 2015年12月31日

含碳气溶胶光谱特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

BiFeO3基薄膜和异质结外电场下磁电响应及耦合研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型苯并二噻吩类二维共轭聚合物的共轭支链的炔类合成及其光伏性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

风光储联合发电系统中的有功功率控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁性掺杂的哈斯勒半导体类薄膜的制备和反常霍尔效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

调节Kupffer细胞介导炎症通路改善追赶生长所致胰岛素抵抗

国家自然科学基金

0+阅读 · 2011年12月31日

超导量子电路中量子态的测量和控制

国家自然科学基金

0+阅读 · 2009年12月31日

ERK1/2抑制剂与LXR配体的组合疗法在防治动脉粥样硬化中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员