变式快速调整改进了愿景语言模式的概括化 (Variational prompt tuning improves generalization of vision-language models) - 专知论文

会员服务 ·

0

泛化理论 · Prompt · MoDELS · tuning · 语言模型化 ·

2022 年 10 月 5 日

Variational prompt tuning improves generalization of vision-language models

翻译：变式快速调整改进了愿景语言模式的概括化

Mohammad Mahdi Derakhshani,Enrique Sanchez,Adrian Bulat,Victor Guilherme Turrisi da Costa,Cees G. M. Snoek,Georgios Tzimiropoulos,Brais Martinez

Prompt tuning provides an efficient mechanism to adapt large vision-language models to downstream tasks by treating part of the input language prompts as learnable parameters while freezing the rest of the model. Existing works for prompt tuning are however prone to damaging the generalization capabilities of the foundation models, because the learned prompts lack the capacity of covering certain concepts within the language model. To avoid such limitation, we propose a probabilistic modeling of the underlying distribution of prompts, allowing prompts within the support of an associated concept to be derived through stochastic sampling. This results in a more complete and richer transfer of the information captured by the language model, providing better generalization capabilities for downstream tasks. The resulting algorithm relies on a simple yet powerful variational framework that can be directly integrated with other developments. We show our approach is seamlessly integrated into both standard and conditional prompt learning frameworks, improving the performance on both cases considerably, especially with regards to preserving the generalization capability of the original model. Our method provides the current state-of-the-art for prompt learning, surpassing CoCoOp by 1.6% average Top-1 accuracy on the standard benchmark. Remarkably, it even surpasses the original CLIP model in terms of generalization to new classes. Implementation code will be released.

翻译：快速调整提供了一个有效的机制,使大型视觉语言模型适应下游任务,办法是将部分输入语言作为可学习的参数处理,同时冻结模型的其余部分内容。现有的快速调整工作容易破坏基础模型的普及能力,因为学习的迅速缺乏在语言模型中涵盖某些概念的能力。为了避免这种限制,我们建议对快速分配的基本分布进行概率模型,允许在支持相关概念的范围内通过随机抽样取出快速信号。这导致更完整和更丰富地传输语言模型收集的信息,为下游任务提供更好的概括化能力。由此产生的算法依赖于一个简单而有力的变异框架,可以直接与其他发展结合起来。我们表明,我们的方法无缝地融入了标准框架和有条件的即时学习框架,大大改进了这两个案例的绩效,特别是在维护原始模型的普及能力方面。我们的方法为迅速学习提供了当前的最新技术,在标准基准上超过了1.6%的平均顶层-1准确度,从而使得新的标准化模型的原始版本超越了CLIP。

0

相关内容

泛化理论

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

半导体/绝缘体高分子复合材料的中重度掺杂性质及应用

国家自然科学基金

0+阅读 · 2014年12月31日

多参数传热反问题的RBF-MLPG方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

多组分复合屏蔽介质在水泥基材料中的作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型FePt基纳米复合永磁材料的研究

国家自然科学基金

0+阅读 · 2013年12月31日

有机分子催化立体选择性反应合成含氟杂环的研究

国家自然科学基金

0+阅读 · 2012年12月31日

核壳发光材料中ZrO2/BN阻挡层的形成及其作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

LiMSO4F(M=Fe, Co, Ni)正极材料的设计与储锂机制及其载流子输运性能

国家自然科学基金

0+阅读 · 2012年12月31日

TiO2@PANI@纳米铁氧体光催化磁流体的制备及其光催化与磁回收性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

中低温可调气氛预处理-高活性催化剂设计新思路

国家自然科学基金

0+阅读 · 2009年12月31日

EPO抑制创伤性脑水肿的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

Few-shot Learning with Multilingual Language Models

Arxiv

0+阅读 · 2022年11月10日

eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月6日

Momentum-based Weight Interpolation of Strong Zero-Shot Models for Continual Learning

Arxiv

0+阅读 · 2022年11月6日

Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning

Arxiv

0+阅读 · 2022年11月6日

CPL: Counterfactual Prompt Learning for Vision and Language Models

Arxiv

0+阅读 · 2022年11月5日

The Benefits of Model-Based Generalization in Reinforcement Learning

Arxiv

0+阅读 · 2022年11月4日

Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models

Arxiv

0+阅读 · 2022年11月4日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Few-shot Learning with Multilingual Language Models

Arxiv

0+阅读 · 2022年11月10日

eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月6日

Momentum-based Weight Interpolation of Strong Zero-Shot Models for Continual Learning

Arxiv

0+阅读 · 2022年11月6日

Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning

Arxiv

0+阅读 · 2022年11月6日

CPL: Counterfactual Prompt Learning for Vision and Language Models

Arxiv

0+阅读 · 2022年11月5日

The Benefits of Model-Based Generalization in Reinforcement Learning

Arxiv

0+阅读 · 2022年11月4日

Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models

Arxiv

0+阅读 · 2022年11月4日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

相关基金

半导体/绝缘体高分子复合材料的中重度掺杂性质及应用

国家自然科学基金

0+阅读 · 2014年12月31日

多参数传热反问题的RBF-MLPG方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

多组分复合屏蔽介质在水泥基材料中的作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型FePt基纳米复合永磁材料的研究

国家自然科学基金

0+阅读 · 2013年12月31日

有机分子催化立体选择性反应合成含氟杂环的研究

国家自然科学基金

0+阅读 · 2012年12月31日

核壳发光材料中ZrO2/BN阻挡层的形成及其作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

LiMSO4F(M=Fe, Co, Ni)正极材料的设计与储锂机制及其载流子输运性能

国家自然科学基金

0+阅读 · 2012年12月31日

TiO2@PANI@纳米铁氧体光催化磁流体的制备及其光催化与磁回收性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

中低温可调气氛预处理-高活性催化剂设计新思路

国家自然科学基金

0+阅读 · 2009年12月31日

EPO抑制创伤性脑水肿的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员