即时提款的即时拉动梯度 (Prompt-aligned Gradient for Prompt Tuning) - 专知论文

会员服务 ·

0

Prompt · tuning · Extensibility · 相似度 · 类别 ·

2022 年 5 月 30 日

Prompt-aligned Gradient for Prompt Tuning

翻译：即时提款的即时拉动梯度

Beier Zhu,Yulei Niu,Yucheng Han,Yue Wu,Hanwang Zhang

Thanks to the large pre-trained vision-language models (VLMs) like CLIP, we can craft a zero-shot classifier by "prompt", e.g., the confidence score of an image being "[CLASS]" can be obtained by using the VLM provided similarity measure between the image and the prompt sentence "a photo of a [CLASS]". Therefore, prompt shows a great potential for fast adaptation of VLMs to downstream tasks if we fine-tune the prompt-based similarity measure. However, we find a common failure that improper fine-tuning may not only undermine the prompt's inherent prediction for the task-related classes, but also for other classes in the VLM vocabulary. Existing methods still address this problem by using traditional anti-overfitting techniques such as early stopping and data augmentation, which lack a principled solution specific to prompt. We present Prompt-aligned Gradient, dubbed ProGrad, to prevent prompt tuning from forgetting the the general knowledge learned from VLMs. In particular, ProGrad only updates the prompt whose gradient is aligned (or non-conflicting) to the "general direction", which is represented as the gradient of the KL loss of the pre-defined prompt prediction. Extensive experiments demonstrate the stronger few-shot generalization ability of ProGrad over state-of-the-art prompt tuning methods. Codes are available at https://github.com/BeierZhu/Prompt-align.

翻译：由于CLIP等经过事先训练的大型视觉语言模型(VLM),我们可以用“快速”来制作一个零发分解器,例如,使用VLM提供的图像与“CLAS”相近的快速句子之间的类似度量,就可以获得“[CLAS]照片”的置信分。因此,快速显示,如果我们微调基于迅速的类似度度量,那么VLMS的快速适应下游任务就有很大潜力。然而,我们发现一个常见的失败,即不适当的微调不仅会破坏任务相关等级的及时内在预测,而且会破坏VLM词汇中其他等级的“[CLASS]”图像的置信分。现有的方法仍然能够通过使用传统的反适应技术来解决这个问题,例如早期停止和数据扩充,而这种技术缺乏一个具体针对迅速的有原则性的解决办法。我们介绍“快速调整”的普罗格拉德,以便防止迅速调整从忘记从VLMS中学获得的一般知识。特别是,ProGradd只更新其梯度(或非冲突性)与任务相关等级/高级预测测度能力,这代表了Glas-Graftal-laftal-lab-lab-laftal-laudal-laudal-lab-lab-lauddal-lab-lab-lab-lab-lab-lab-lab-lab-lab-ladal-lad-lad-ladal-ladal-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-ladal-ladal-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-lad-

2

相关内容

Prompt

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

【浙大-WWW2022】OntoPrompt & KnowPrompt：知识提示的预训练微调

【浙大-WWW2022】OntoPrompt & KnowPrompt：知识提示的预训练微调

专知会员服务

48+阅读 · 2022年1月26日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

岩石裂隙渗透性尺寸效应的试验及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA uc002bbp.2在 NSCLC顺铂耐药中的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

纳米双金属复合氧化物催化臭氧的效能及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

橡胶果破碎过程及壳仁低损伤分离机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于Cosserat连续体平均场理论的颗粒材料多尺度计算均匀化

国家自然科学基金

0+阅读 · 2012年12月31日

基于公平性的城市道路交通网络设计模型与算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

适应反应基因ATF3调控细胞骨架重构抑制膀胱癌转移的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于环境激励的钢筋混凝土立筒群仓动力相互作用机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

牛Nanog基因启动子区负调控元件功能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

MiR-221/222及其靶基因ADAMs调控血管生成分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

STT: Soft Template Tuning for Few-Shot Adaptation

Arxiv

0+阅读 · 2022年7月18日

Zero-Shot Temporal Action Detection via Vision-Language Prompting

Arxiv

0+阅读 · 2022年7月17日

Prompt Injection: Parameterization of Fixed Inputs

Arxiv

1+阅读 · 2022年7月15日

Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting

Arxiv

0+阅读 · 2022年7月15日

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers

Arxiv

0+阅读 · 2022年7月14日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Learning from Few Samples: A Survey

Learning from Few Samples: A Survey

Arxiv

77+阅读 · 2020年7月30日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

【浙大-WWW2022】OntoPrompt & KnowPrompt：知识提示的预训练微调

【浙大-WWW2022】OntoPrompt & KnowPrompt：知识提示的预训练微调

专知会员服务

48+阅读 · 2022年1月26日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《战区安全决策课程体系》最新244页

《"无人机航母"原型平台》

任务规划与地形分析：现代复杂环境作战导航体系

《攻击场景描述形式化模型研究》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

STT: Soft Template Tuning for Few-Shot Adaptation

Arxiv

0+阅读 · 2022年7月18日

Zero-Shot Temporal Action Detection via Vision-Language Prompting

Arxiv

0+阅读 · 2022年7月17日

Prompt Injection: Parameterization of Fixed Inputs

Arxiv

1+阅读 · 2022年7月15日

Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting

Arxiv

0+阅读 · 2022年7月15日

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers

Arxiv

0+阅读 · 2022年7月14日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Learning from Few Samples: A Survey

Learning from Few Samples: A Survey

Arxiv

77+阅读 · 2020年7月30日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

相关基金

岩石裂隙渗透性尺寸效应的试验及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA uc002bbp.2在 NSCLC顺铂耐药中的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

纳米双金属复合氧化物催化臭氧的效能及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

橡胶果破碎过程及壳仁低损伤分离机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于Cosserat连续体平均场理论的颗粒材料多尺度计算均匀化

国家自然科学基金

0+阅读 · 2012年12月31日

基于公平性的城市道路交通网络设计模型与算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

适应反应基因ATF3调控细胞骨架重构抑制膀胱癌转移的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于环境激励的钢筋混凝土立筒群仓动力相互作用机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

牛Nanog基因启动子区负调控元件功能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

MiR-221/222及其靶基因ADAMs调控血管生成分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员