愿景语言培训前培训前模型的类软件视觉快速图示 (Class-Aware Visual Prompt Tuning for Vision-Language Pre-Trained Model) - 专知论文

会员服务 ·

0

Prompt · tuning · MoDELS · Extensibility · 次最优 ·

2022 年 8 月 22 日

Class-Aware Visual Prompt Tuning for Vision-Language Pre-Trained Model

翻译：愿景语言培训前培训前模型的类软件视觉快速图示

Yinghui Xing,Qirui Wu,De Cheng,Shizhou Zhang,Guoqiang Liang,Yanning Zhang

from arxiv, 9 pages, 4 figures

With the emergence of large pre-trained vison-language model like CLIP, transferrable representations can be adapted to a wide range of downstream tasks via prompt tuning. Prompt tuning tries to probe the beneficial information for downstream tasks from the general knowledge stored in both the image and text encoders of the pre-trained vision-language model. A recently proposed method named Context Optimization (CoOp) introduces a set of learnable vectors as text prompt from the language side, while tuning the text prompt alone can not affect the computed visual features of the image encoder, thus leading to sub-optimal. In this paper, we propose a dual modality prompt tuning paradigm through learning text prompts and visual prompts for both the text and image encoder simultaneously. In addition, to make the visual prompt concentrate more on the target visual concept, we propose Class-Aware Visual Prompt Tuning (CAVPT), which is generated dynamically by performing the cross attention between language descriptions of template prompts and visual class token embeddings. Our method provides a new paradigm for tuning the large pre-trained vision-language model and extensive experimental results on 8 datasets demonstrate the effectiveness of the proposed method. Our code is available in the supplementary materials.

翻译：随着像CLIP这样的大型预先培训的相对语言模型的出现,可移植的表达方式可以通过快速调适来适应一系列广泛的下游任务。快速调试尝试从预培训的视觉语言模型的图像和文本编码器中存储的一般知识中探寻有益于下游任务的信息。最近提出的一种名为“ 环境优化” (CoOp) 的方法引入了一套可学习的矢量,作为语言侧面的文本提示,而光调快速的文本不会影响图像编码器的计算视觉特征,从而导致次优化。在本文中,我们提出一种双模式快速调控模式模式,通过学习文本提示和图像编码器的视觉提示同时对下游任务进行。此外,为了让视觉提示更多集中于目标视觉概念,我们提议了一套名为“ 视觉快速优化” (CAVAVPTT) 的方法,这是通过对模板提示和视觉类符号嵌入的语言描述进行交叉关注而动态生成的。我们的方法为调整大型预培训的视觉语言模型模型模型提供了一种新的模式模式模式, 并且广泛实验性结果显示我们在8号的拟议数据设置中提供的方法。

0

相关内容

Prompt

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【CVPR 2022】基于层次化视觉语言知识蒸馏的开放词汇单阶段检测，Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

【CVPR 2022】基于层次化视觉语言知识蒸馏的开放词汇单阶段检测，Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

专知会员服务

7+阅读 · 2022年3月19日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【CVPR2020-UBC】改进小样本学习视觉分类，Few-Shot Visual Classification

【CVPR2020-UBC】改进小样本学习视觉分类，Few-Shot Visual Classification

专知会员服务

68+阅读 · 2020年2月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

中性粒细胞TRPM2通道在脓毒症细菌清除中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

AML1-Evi1基因致急性髓系白血病的分子机制研究（斑马鱼模型）

国家自然科学基金

0+阅读 · 2014年12月31日

hedgehog-Gli信号通路在骨髓基质细胞对急性髓系白血病细胞凋亡影响中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

大鼠浅筋膜中腺苷受体与针刺镇痛相关性研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型聚芴膜的构筑及对空气和水中芳烃类爆炸物的检测

国家自然科学基金

0+阅读 · 2013年12月31日

集成LNG冷能利用的超临界压缩空气储能系统研究探索

国家自然科学基金

0+阅读 · 2012年12月31日

新型多光源钒基介孔有机-无机杂化发光材料研究

国家自然科学基金

0+阅读 · 2012年12月31日

玉米种子老化的表观遗传机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Nac-1对c-Myc和Klf4基因的转录调控及其调节ES细胞多能性的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

功能金属有机膦酸杂化材料的构筑

国家自然科学基金

0+阅读 · 2009年12月31日

MaPLe: Multi-modal Prompt Learning

Arxiv

0+阅读 · 2022年10月6日

Real-World Robot Learning with Masked Visual Pre-training

Arxiv

0+阅读 · 2022年10月6日

Conditional Prompt Learning for Vision-Language Models

Arxiv

0+阅读 · 2022年10月6日

Learning to Prompt for Vision-Language Models

Arxiv

0+阅读 · 2022年10月6日

Variational prompt tuning improves generalization of vision-language models

Arxiv

0+阅读 · 2022年10月5日

PSP: Pre-trained Soft Prompts for Few-Shot Abstractive Summarization

Arxiv

1+阅读 · 2022年10月4日

Prompt Learning with Optimal Transport for Vision-Language Models

Arxiv

1+阅读 · 2022年10月3日

Language-Aware Soft Prompting for Vision & Language Foundation Models

Arxiv

0+阅读 · 2022年10月3日

Visual Prompt Tuning for Generative Transfer Learning

Arxiv

0+阅读 · 2022年10月3日

Prompt Tuning for Graph Neural Networks

Arxiv

0+阅读 · 2022年9月30日

VIP会员

文章信息

相关主题

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【CVPR 2022】基于层次化视觉语言知识蒸馏的开放词汇单阶段检测，Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

【CVPR 2022】基于层次化视觉语言知识蒸馏的开放词汇单阶段检测，Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

专知会员服务

7+阅读 · 2022年3月19日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【CVPR2020-UBC】改进小样本学习视觉分类，Few-Shot Visual Classification

【CVPR2020-UBC】改进小样本学习视觉分类，Few-Shot Visual Classification

专知会员服务

68+阅读 · 2020年2月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

MaPLe: Multi-modal Prompt Learning

Arxiv

0+阅读 · 2022年10月6日

Real-World Robot Learning with Masked Visual Pre-training

Arxiv

0+阅读 · 2022年10月6日

Conditional Prompt Learning for Vision-Language Models

Arxiv

0+阅读 · 2022年10月6日

Learning to Prompt for Vision-Language Models

Arxiv

0+阅读 · 2022年10月6日

Variational prompt tuning improves generalization of vision-language models

Arxiv

0+阅读 · 2022年10月5日

PSP: Pre-trained Soft Prompts for Few-Shot Abstractive Summarization

Arxiv

1+阅读 · 2022年10月4日

Prompt Learning with Optimal Transport for Vision-Language Models

Arxiv

1+阅读 · 2022年10月3日

Language-Aware Soft Prompting for Vision & Language Foundation Models

Arxiv

0+阅读 · 2022年10月3日

Visual Prompt Tuning for Generative Transfer Learning

Arxiv

0+阅读 · 2022年10月3日

Prompt Tuning for Graph Neural Networks

Arxiv

0+阅读 · 2022年9月30日

相关基金

中性粒细胞TRPM2通道在脓毒症细菌清除中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

AML1-Evi1基因致急性髓系白血病的分子机制研究（斑马鱼模型）

国家自然科学基金

0+阅读 · 2014年12月31日

hedgehog-Gli信号通路在骨髓基质细胞对急性髓系白血病细胞凋亡影响中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

大鼠浅筋膜中腺苷受体与针刺镇痛相关性研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型聚芴膜的构筑及对空气和水中芳烃类爆炸物的检测

国家自然科学基金

0+阅读 · 2013年12月31日

集成LNG冷能利用的超临界压缩空气储能系统研究探索

国家自然科学基金

0+阅读 · 2012年12月31日

新型多光源钒基介孔有机-无机杂化发光材料研究

国家自然科学基金

0+阅读 · 2012年12月31日

玉米种子老化的表观遗传机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Nac-1对c-Myc和Klf4基因的转录调控及其调节ES细胞多能性的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

功能金属有机膦酸杂化材料的构筑

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员