SgVA-CLIP: 用于微小图像分类的视觉语言模型的语义-指导视觉适应 (SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image Classification) - 专知论文

会员服务 ·

0

图片分类 · 小样本学习 · Learning · 知识 (knowledge) · MoDELS ·

2023 年 1 月 20 日

SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image Classification

翻译：SgVA-CLIP: 用于微小图像分类的视觉语言模型的语义-指导视觉适应

Fang Peng,Xiaoshan Yang,Linhui Xiao,Yaowei Wang,Changsheng Xu

Although significant progress has been made in few-shot learning, most of existing few-shot image classification methods require supervised pre-training on a large amount of samples of base classes, which limits their generalization ability in real world application. Recently, large-scale Vision-Language Pre-trained models (VLPs) have been gaining increasing attention in few-shot learning because they can provide a new paradigm for transferable visual representation learning with easily available text on the Web. However, the VLPs may neglect detailed visual information that is difficult to describe by language sentences, but important for learning an effective classifier to distinguish different images. To address the above problem, we propose a new framework, named Semantic-guided Visual Adapting (SgVA), which can effectively extend vision-language pre-trained models to produce discriminative adapted visual features by comprehensively using an implicit knowledge distillation, a vision-specific contrastive loss, and a cross-modal contrastive loss. The implicit knowledge distillation is designed to transfer the fine-grained cross-modal knowledge to guide the updating of the vision adapter. State-of-the-art results on 13 datasets demonstrate that the adapted visual features can well complement the cross-modal features to improve few-shot image classification.

翻译：尽管在短片学习方面取得了显著进展,但大多数现有的短片图像分类方法都要求就大量基础类样本进行监督的预培训,这限制了他们在现实世界应用中的一般化能力。最近,大规模视觉语言预修模型(VLPs)在短片学习中日益受到重视,因为这些模型可以为可转移的视觉表现学习提供新的范例,并可在网上轻易获得文本。但是,VLPs可能会忽视难以用语言句描述的详细视觉信息,但对于学习一个有效的分类器以区分不同图像十分重要。为了解决上述问题,我们提议了一个新框架,名为Semictic-指导视觉调整(SgVA),它能够有效地扩展视觉预修习模型,通过全面利用隐含的知识蒸馏、针对具体愿景的对比损失和交叉模式的对比损失,从而产生歧视性的经调整的视觉特征。隐性知识蒸馏旨在将精细的跨模式知识转换成一个用于指导对图像的更新。为了应对上述问题,我们提出了一个新的框架,名为Semantical-Guideded-Defal Redual Refal-dal-fal-flational 13 maphal magistrat-s

0

相关内容

图片分类

图像分类，顾名思义，是一个输入图像，输出对该图像内容分类的描述的问题。它是计算机视觉的核心，实际应用广泛。

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

专知会员服务

96+阅读 · 2020年3月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

碳纤维三维编织复合材料损伤变形场及渐进破坏实验研究

国家自然科学基金

0+阅读 · 2015年12月31日

近红外II区荧光成像技术辅助的病毒纳米颗粒活体肿瘤靶向研究

国家自然科学基金

0+阅读 · 2014年12月31日

以ED-A(+)Fn为靶点超声纳米分子成像及靶向治疗心脏移植慢性排斥反应

国家自然科学基金

0+阅读 · 2014年12月31日

荧光-磁双模态纳米载体装载Survivin siRNA 对胶质瘤干细胞增殖的影响及作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

靶向Neuropilin和GPC-3受体的双靶点PET分子探针的构建和鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

IKIP在直肠癌术前放疗敏感性预测中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

巨噬细胞移动抑制因子靶向FAK信号通路调控气道平滑肌细胞粘附、迁移和增殖的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

微细通道内甲烷/湿空气滞止流燃烧机理与特性

国家自然科学基金

0+阅读 · 2012年12月31日

基于Frenet标架曲率半径函数的涡旋型线构建理论与特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

SuS-X: Training-Free Name-Only Transfer of Vision-Language Models

Arxiv

0+阅读 · 2023年3月14日

Context Normalization for Robust Image Classification

Arxiv

0+阅读 · 2023年3月14日

Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos

Arxiv

0+阅读 · 2023年3月13日

Knowledge Transfer via Multi-Head Feature Adaptation for Whole Slide Image Classification

Arxiv

0+阅读 · 2023年3月10日

Semantic-Preserving Augmentation for Robust Image-Text Retrieval

Arxiv

0+阅读 · 2023年3月10日

Masked Unsupervised Self-training for Label-free Image Classification

Arxiv

0+阅读 · 2023年3月10日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Graph Convolutional Networks for Text Classification

Arxiv

31+阅读 · 2018年11月13日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

VIP会员

文章信息

相关主题

小样本学习

知识 (knowledge)

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

专知会员服务

96+阅读 · 2020年3月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

SuS-X: Training-Free Name-Only Transfer of Vision-Language Models

Arxiv

0+阅读 · 2023年3月14日

Context Normalization for Robust Image Classification

Arxiv

0+阅读 · 2023年3月14日

Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos

Arxiv

0+阅读 · 2023年3月13日

Knowledge Transfer via Multi-Head Feature Adaptation for Whole Slide Image Classification

Arxiv

0+阅读 · 2023年3月10日

Semantic-Preserving Augmentation for Robust Image-Text Retrieval

Arxiv

0+阅读 · 2023年3月10日

Masked Unsupervised Self-training for Label-free Image Classification

Arxiv

0+阅读 · 2023年3月10日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Graph Convolutional Networks for Text Classification

Arxiv

31+阅读 · 2018年11月13日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

相关基金

碳纤维三维编织复合材料损伤变形场及渐进破坏实验研究

国家自然科学基金

0+阅读 · 2015年12月31日

近红外II区荧光成像技术辅助的病毒纳米颗粒活体肿瘤靶向研究

国家自然科学基金

0+阅读 · 2014年12月31日

以ED-A(+)Fn为靶点超声纳米分子成像及靶向治疗心脏移植慢性排斥反应

国家自然科学基金

0+阅读 · 2014年12月31日

荧光-磁双模态纳米载体装载Survivin siRNA 对胶质瘤干细胞增殖的影响及作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

靶向Neuropilin和GPC-3受体的双靶点PET分子探针的构建和鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

IKIP在直肠癌术前放疗敏感性预测中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

巨噬细胞移动抑制因子靶向FAK信号通路调控气道平滑肌细胞粘附、迁移和增殖的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

微细通道内甲烷/湿空气滞止流燃烧机理与特性

国家自然科学基金

0+阅读 · 2012年12月31日

基于Frenet标架曲率半径函数的涡旋型线构建理论与特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员