MAPL: 几乎没有热速促动的视觉语言通用预培训模型的参数-有效适应</s> (MAPL: Parameter-Efficient Adaptation of Unimodal Pre-Trained Models for Vision-Language Few-Shot Prompting) - 专知论文

会员服务 ·

0

单峰值 · MoDELS · Extensibility · 小样本学习 · Prompt ·

2023 年 3 月 15 日

MAPL: Parameter-Efficient Adaptation of Unimodal Pre-Trained Models for Vision-Language Few-Shot Prompting

翻译：MAPL: 几乎没有热速促动的视觉语言通用预培训模型的参数-有效适应

Oscar Mañas,Pau Rodriguez,Saba Ahmadi,Aida Nematzadeh,Yash Goyal,Aishwarya Agrawal

from arxiv, Accepted at EACL 2023 (main track); 26 pages, 21 figures, 6 tables; Pau Rodriguez and Saba Ahmadi had equal contributions

Large pre-trained models have proved to be remarkable zero- and (prompt-based) few-shot learners in unimodal vision and language tasks. We propose MAPL, a simple and parameter-efficient method that reuses frozen pre-trained unimodal models and leverages their strong generalization capabilities in multimodal vision-language (VL) settings. MAPL learns a lightweight mapping between the representation spaces of unimodal models using aligned image-text data, and can generalize to unseen VL tasks from just a few in-context examples. The small number of trainable parameters makes MAPL effective at low-data and in-domain learning. Moreover, MAPL's modularity enables easy extension to other pre-trained models. Extensive experiments on several visual question answering and image captioning benchmarks show that MAPL achieves superior or competitive performance compared to similar methods while training orders of magnitude fewer parameters. MAPL can be trained in just a few hours using modest computational resources and public datasets. We release our code and pre-trained model weights at https://github.com/mair-lab/mapl.

翻译：经事先培训的大型模型在单一方式愿景和语言任务方面被证明是显著的零和(即时的)微小学习者。我们提出MAPL,这是一种简单和有参数效率的方法,可以重新使用经过事先培训的单一方式模型,并在多式愿景语言(VL)环境中利用其强大的一般化能力。MAPL学会了使用统一的图像文本数据在单一方式模型代表空间之间进行轻量的绘图,并且可以从仅有的几个文本中概括到看不见的VL任务。这些少量的可培训参数使得MAPL在低数据和日常学习方面有效。此外,MAPL的模块化使得很容易推广到其他经过培训的模型。关于几个视觉问题回答和图像描述基准的广泛实验表明,MAPL在类似方法下取得了优异性或竞争性的性能,而培训的参数则要小得多。使用少量的计算资源和公共数据集,可以进行短几个小时的培训。我们在 https://github.com/mair-lab/mapl上发布我们的代码和事先训练过的模型重量。我们公布在https://github.com/mair-lab/mapl。</s>

0

相关内容

单峰值

不可错过！首门《自监督学习统计模型》课程！霍普金斯Daniel Khashabi讲授

不可错过！首门《自监督学习统计模型》课程！霍普金斯Daniel Khashabi讲授

专知会员服务

24+阅读 · 2022年9月30日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

臭氧光催化转化的基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

IFN-γ通过EZH2介导lncRNA调控肝癌中枯否细胞表达Galectin-9的机制

国家自然科学基金

0+阅读 · 2013年12月31日

Ba基复合钙钛矿陶瓷的有序/无序相变、畴结构与微波介电性能

国家自然科学基金

0+阅读 · 2012年12月31日

高功率窄线宽中红外光参量技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型的全固态Ho:BaY2F8中红外激光器动力学及输出特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

具有未配位活性基团的多孔配位聚合物的设计及催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

哮喘中T细胞活化衔接子对调节性T细胞调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

Notch信号通路负性调控哮喘小鼠气道杯状细胞MUC5AC的合成及其机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于EMCCD的二维天文光子计数成像技术

国家自然科学基金

0+阅读 · 2009年12月31日

UNTER: A Unified Knowledge Interface for Enhancing Pre-trained Language Models

Arxiv

0+阅读 · 2023年5月5日

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

Arxiv

0+阅读 · 2023年5月4日

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Arxiv

22+阅读 · 2023年5月3日

Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime

Arxiv

0+阅读 · 2023年5月3日

The Benefits of Label-Description Training for Zero-Shot Text Classification

Arxiv

0+阅读 · 2023年5月3日

SLTUNET: A Simple Unified Model for Sign Language Translation

Arxiv

0+阅读 · 2023年5月2日

Multimodal Prompting with Missing Modalities for Visual Recognition

Arxiv

11+阅读 · 2023年3月6日

Conditional Prompt Learning for Vision-Language Models

Conditional Prompt Learning for Vision-Language Models

Arxiv

13+阅读 · 2022年3月10日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

VIP会员

文章信息

相关主题

小样本学习

相关VIP内容

不可错过！首门《自监督学习统计模型》课程！霍普金斯Daniel Khashabi讲授

不可错过！首门《自监督学习统计模型》课程！霍普金斯Daniel Khashabi讲授

专知会员服务

24+阅读 · 2022年9月30日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《多智能体不确定环境追逃博弈研究》216页

美智库最新发布《解放军"人机编组协同作战"发展路径：理论与实践》53页

现代战争"杀伤区"理论：空间尺度与结构特征、控制手段与毁伤机制、生存策略与战线转移

《俄军无人机创新技术或已在乌克兰达成"战场空中封锁"作战效果》最新18页报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

UNTER: A Unified Knowledge Interface for Enhancing Pre-trained Language Models

Arxiv

0+阅读 · 2023年5月5日

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

Arxiv

0+阅读 · 2023年5月4日

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Arxiv

22+阅读 · 2023年5月3日

Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime

Arxiv

0+阅读 · 2023年5月3日

The Benefits of Label-Description Training for Zero-Shot Text Classification

Arxiv

0+阅读 · 2023年5月3日

SLTUNET: A Simple Unified Model for Sign Language Translation

Arxiv

0+阅读 · 2023年5月2日

Multimodal Prompting with Missing Modalities for Visual Recognition

Arxiv

11+阅读 · 2023年3月6日

Conditional Prompt Learning for Vision-Language Models

Conditional Prompt Learning for Vision-Language Models

Arxiv

13+阅读 · 2022年3月10日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

相关基金

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

臭氧光催化转化的基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

IFN-γ通过EZH2介导lncRNA调控肝癌中枯否细胞表达Galectin-9的机制

国家自然科学基金

0+阅读 · 2013年12月31日

Ba基复合钙钛矿陶瓷的有序/无序相变、畴结构与微波介电性能

国家自然科学基金

0+阅读 · 2012年12月31日

高功率窄线宽中红外光参量技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型的全固态Ho:BaY2F8中红外激光器动力学及输出特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

具有未配位活性基团的多孔配位聚合物的设计及催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

哮喘中T细胞活化衔接子对调节性T细胞调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

Notch信号通路负性调控哮喘小鼠气道杯状细胞MUC5AC的合成及其机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于EMCCD的二维天文光子计数成像技术

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员