预调:愿景任务统一快速提款 (Pro-tuning: Unified Prompt Tuning for Vision Tasks) - 专知论文

会员服务 ·

0

Prompt · Vision · tuning · Performer · Learning ·

2022 年 7 月 28 日

Pro-tuning: Unified Prompt Tuning for Vision Tasks

翻译：预调:愿景任务统一快速提款

Xing Nie,Bolin Ni,Jianlong Chang,Gaomeng Meng,Chunlei Huo,Zhaoxiang Zhang,Shiming Xiang,Qi Tian,Chunhong Pan

In computer vision, fine-tuning is the de-facto approach to leverage pre-trained vision models to perform downstream tasks. However, deploying it in practice is quite challenging, due to adopting parameter inefficient global update and heavily relying on high-quality downstream data. Recently, prompt-based learning, which adds a task-relevant prompt to adapt the downstream tasks to pre-trained models, has drastically boosted the performance of many natural language downstream tasks. In this work, we extend this notable transfer ability benefited from prompt into vision models as an alternative to fine-tuning. To this end, we propose parameter-efficient Prompt tuning (Pro-tuning) to adapt frozen vision models to various downstream vision tasks. The key to Pro-tuning is prompt-based tuning, i.e., learning task-specific vision prompts for downstream input images with the pre-trained model frozen. By only training a few additional parameters, it can work on diverse CNN-based and Transformer-based architectures. Extensive experiments evidence that Pro-tuning outperforms fine-tuning in a broad range of vision tasks and scenarios, including image classification (generic objects, class imbalance, image corruption, adversarial robustness, and out-of-distribution generalization), and dense prediction tasks such as object detection and semantic segmentation.

翻译：在计算机愿景中,微调是利用预先培训的愿景模型来完成下游任务的一种不简单的方法,但在实际中,部署该功能是相当具有挑战性的,因为采用了低效率的全球更新参数,并严重依赖高质量的下游数据。最近,基于迅速的学习,增加了与任务相关的及时性,使下游任务适应经过培训的模型,极大地提高了许多自然语言下游任务的绩效。在这项工作中,我们将这种显著的转移能力从迅速的转换能力扩大到愿景模型,作为微调的替代。为此,我们建议采用节能的快速调控(快速调控),使冷冻的愿景模型适应各种下游愿景任务。 Pro调的关键是基于快速的调控,即学习特定任务性愿景,通过对经过事先培训的模式的下游投入图像进行快速调,仅培训少数额外的参数,就能使基于CNN和基于变压器的架构发挥作用。我们提出了广泛的实验证据,在广泛的愿景任务和情景中进行微调,包括图像分类(基因物体、阶级失衡、高压性图像检测、高压性图像分析任务和高压性平流。

0

相关内容

Prompt

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

PVA凝胶材料的动态力学性能及实验测试技术与表征方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

支气管上皮细胞klotho表达在慢性阻塞性肺气肿形成中作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

节水改造对大型干旱灌区水盐运移过程的影响机理及生态环境效应

国家自然科学基金

0+阅读 · 2014年12月31日

解吸-渗流-应力作用下煤体变形规律及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

聚电解质的表征

国家自然科学基金

0+阅读 · 2011年12月31日

干旱胁迫下的城市林木生长和水分利用对臭氧污染的响应机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

中高温燃料电池用新型含硫聚苯并咪唑复合质子交换膜的制备及性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

肝移植后缺血性胆道病变超声造影早期诊断的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

手性金团簇的动力学控制合成、表征及性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

笼型倍半硅氧烷（POSS）ATRP接枝含氟嵌段共聚物纳米复合微球的合成与性能

国家自然科学基金

0+阅读 · 2008年12月31日

Prompt-driven efficient Open-set Semi-supervised Learning

Arxiv

0+阅读 · 2022年9月28日

Supervised Contrastive Learning as Multi-Objective Optimization for Fine-Tuning Large Pre-trained Language Models

Supervised Contrastive Learning as Multi-Objective Optimization for Fine-Tuning Large Pre-trained Language Models

Arxiv

0+阅读 · 2022年9月28日

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks

Arxiv

0+阅读 · 2022年9月28日

Collaboration of Pre-trained Models Makes Better Few-shot Learner

Arxiv

0+阅读 · 2022年9月25日

A Unified Generative Framework based on Prompt Learning for Various Information Extraction Tasks

Arxiv

0+阅读 · 2022年9月23日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Survey: Transformer based Video-Language Pre-training

Arxiv

20+阅读 · 2021年9月21日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

A Survey of Methods for Low-Power Deep Learning and Computer Vision

A Survey of Methods for Low-Power Deep Learning and Computer Vision

Arxiv

14+阅读 · 2020年3月24日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄罗斯核条令演变趋势》最新56页报告

【CMU博士论文】以人为中心的强化学习

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

认知优势：人工智能在国家安全决策中的核心作用

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Prompt-driven efficient Open-set Semi-supervised Learning

Arxiv

0+阅读 · 2022年9月28日

Supervised Contrastive Learning as Multi-Objective Optimization for Fine-Tuning Large Pre-trained Language Models

Supervised Contrastive Learning as Multi-Objective Optimization for Fine-Tuning Large Pre-trained Language Models

Arxiv

0+阅读 · 2022年9月28日

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks

Arxiv

0+阅读 · 2022年9月28日

Collaboration of Pre-trained Models Makes Better Few-shot Learner

Arxiv

0+阅读 · 2022年9月25日

A Unified Generative Framework based on Prompt Learning for Various Information Extraction Tasks

Arxiv

0+阅读 · 2022年9月23日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Survey: Transformer based Video-Language Pre-training

Arxiv

20+阅读 · 2021年9月21日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

A Survey of Methods for Low-Power Deep Learning and Computer Vision

A Survey of Methods for Low-Power Deep Learning and Computer Vision

Arxiv

14+阅读 · 2020年3月24日

相关基金

PVA凝胶材料的动态力学性能及实验测试技术与表征方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

支气管上皮细胞klotho表达在慢性阻塞性肺气肿形成中作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

节水改造对大型干旱灌区水盐运移过程的影响机理及生态环境效应

国家自然科学基金

0+阅读 · 2014年12月31日

解吸-渗流-应力作用下煤体变形规律及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

聚电解质的表征

国家自然科学基金

0+阅读 · 2011年12月31日

干旱胁迫下的城市林木生长和水分利用对臭氧污染的响应机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

中高温燃料电池用新型含硫聚苯并咪唑复合质子交换膜的制备及性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

肝移植后缺血性胆道病变超声造影早期诊断的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

手性金团簇的动力学控制合成、表征及性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

笼型倍半硅氧烷（POSS）ATRP接枝含氟嵌段共聚物纳米复合微球的合成与性能

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员