> 证据 > 直觉:对选择编码器的可转移性估计 (Evidence > Intuition: Transferability Estimation for Encoder Selection) - 专知论文

会员服务 ·

0

估计/估计量 · Performer · NLP · 可辨认的 · LogME ·

2022 年 10 月 20 日

Evidence > Intuition: Transferability Estimation for Encoder Selection

翻译：> 证据 > 直觉:对选择编码器的可转移性估计

Elisa Bassignana,Max Müller-Eberstein,Mike Zhang,Barbara Plank

from arxiv, Accepted at EMNLP 2022 (main conference)

With the increase in availability of large pre-trained language models (LMs) in Natural Language Processing (NLP), it becomes critical to assess their fit for a specific target task a priori - as fine-tuning the entire space of available LMs is computationally prohibitive and unsustainable. However, encoder transferability estimation has received little to no attention in NLP. In this paper, we propose to generate quantitative evidence to predict which LM, out of a pool of models, will perform best on a target task without having to fine-tune all candidates. We provide a comprehensive study on LM ranking for 10 NLP tasks spanning the two fundamental problem types of classification and structured prediction. We adopt the state-of-the-art Logarithm of Maximum Evidence (LogME) measure from Computer Vision (CV) and find that it positively correlates with final LM performance in 94% of the setups. In the first study of its kind, we further compare transferability measures with the de facto standard of human practitioner ranking, finding that evidence from quantitative metrics is more robust than pure intuition and can help identify unexpected LM candidates.

翻译：随着在自然语言处理(NLP)中接受过培训的大型语言模型(LMs)的提供量的增加,必须先验地评估它们是否适合具体的目标任务,因为微调现有语言模型的整个空间在计算上是令人望而却步的,而且不可持续。然而,在NLP中,编码可转移性估计很少得到重视,甚至没有引起人们的注意。在本文中,我们建议从一组模型中产生定量证据,预测哪些LM(LM)在一项目标任务上表现最佳,而不必对所有候选人进行微调。我们提供了关于10项NLP(LP)任务的LM等级的全面研究,涵盖两种基本问题的分类和结构预测类型。我们采用了计算机视野(CV)中最先进的最高证据逻辑(LogME)测量方法,发现它与94%的设置中最终LM性能呈正比。在这类模型的第一次研究中,我们进一步比较了可转移性措施与实际的人类执业人员等级标准,发现定量指标的证据比纯粹的直觉和能够查明意外的LM候选人。

0

相关内容

估计/估计量

估计/估计量

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【CMU】可扩展人工智能白皮书

专知会员服务

28+阅读 · 2021年7月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

3d金属元素基产氧电催化剂的局域原子协同机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

高分子磷光材料

国家自然科学基金

1+阅读 · 2014年12月31日

类胡萝卜素代谢途径转录调控基因与代谢节点的挖掘

国家自然科学基金

0+阅读 · 2013年12月31日

有限温度下位错的芯结构与Perierls应力的研究

国家自然科学基金

0+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

5-杂芳环取代嘧啶核苷类似物的合成新方法及生物活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境中身份基海量数据分布式PDP的研究

国家自然科学基金

0+阅读 · 2012年12月31日

曲线与曲面造型中若干逼近与收敛性问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

一种适用于高维问题的Co-kriging代理模型新方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

表面等离子体共振单核苷酸多态性分型新方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

Quantum NETwork: from theory to practice

Arxiv

0+阅读 · 2022年12月2日

Rethinking Two Consensuses of the Transferability in Deep Learning

Arxiv

0+阅读 · 2022年12月1日

Efficient variational approximations for state space models

Arxiv

0+阅读 · 2022年11月30日

Using Text Classification with a Bayesian Correction for Estimating Overreporting in the Creditor Reporting System on Climate Adaptation Finance

Arxiv

0+阅读 · 2022年11月30日

Generalized Deep Learning-based Proximal Gradient Descent for MR Reconstruction

Arxiv

0+阅读 · 2022年11月30日

Explicit Knowledge Transfer for Weakly-Supervised Code Generation

Arxiv

0+阅读 · 2022年11月30日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

Pose-Normalized Image Generation for Person Re-identification

Arxiv

11+阅读 · 2018年1月18日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【CMU】可扩展人工智能白皮书

专知会员服务

28+阅读 · 2021年7月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《面向无人机集群的避障动态传感器覆盖算法》最新38页

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Quantum NETwork: from theory to practice

Arxiv

0+阅读 · 2022年12月2日

Rethinking Two Consensuses of the Transferability in Deep Learning

Arxiv

0+阅读 · 2022年12月1日

Efficient variational approximations for state space models

Arxiv

0+阅读 · 2022年11月30日

Using Text Classification with a Bayesian Correction for Estimating Overreporting in the Creditor Reporting System on Climate Adaptation Finance

Arxiv

0+阅读 · 2022年11月30日

Generalized Deep Learning-based Proximal Gradient Descent for MR Reconstruction

Arxiv

0+阅读 · 2022年11月30日

Explicit Knowledge Transfer for Weakly-Supervised Code Generation

Arxiv

0+阅读 · 2022年11月30日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

Pose-Normalized Image Generation for Person Re-identification

Arxiv

11+阅读 · 2018年1月18日

相关基金

3d金属元素基产氧电催化剂的局域原子协同机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

高分子磷光材料

国家自然科学基金

1+阅读 · 2014年12月31日

类胡萝卜素代谢途径转录调控基因与代谢节点的挖掘

国家自然科学基金

0+阅读 · 2013年12月31日

有限温度下位错的芯结构与Perierls应力的研究

国家自然科学基金

0+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

5-杂芳环取代嘧啶核苷类似物的合成新方法及生物活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境中身份基海量数据分布式PDP的研究

国家自然科学基金

0+阅读 · 2012年12月31日

曲线与曲面造型中若干逼近与收敛性问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

一种适用于高维问题的Co-kriging代理模型新方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

表面等离子体共振单核苷酸多态性分型新方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员