MaskCLIP: 蒙面的自我学习进步 (MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining) - 专知论文

会员服务 ·

0

contrastive · 掩码 · Learning · Analysis · Guidance ·

2022 年 8 月 25 日

MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining

翻译：MaskCLIP: 蒙面的自我学习进步

Xiaoyi Dong,Yinglin Zheng,Jianmin Bao,Ting Zhang,Dongdong Chen,Hao Yang,Ming Zeng,Weiming Zhang,Lu Yuan,Dong Chen,Fang Wen,Nenghai Yu

This paper presents a simple yet effective framework MaskCLIP, which incorporates a newly proposed masked self-distillation into contrastive language-image pretraining. The core idea of masked self-distillation is to distill representation from a full image to the representation predicted from a masked image. Such incorporation enjoys two vital benefits. First, masked self-distillation targets local patch representation learning, which is complementary to vision-language contrastive focusing on text-related representation.Second, masked self-distillation is also consistent with vision-language contrastive from the perspective of training objective as both utilize the visual encoder for feature aligning, and thus is able to learn local semantics getting indirect supervision from the language. We provide specially designed experiments with a comprehensive analysis to validate the two benefits. Empirically, we show that MaskCLIP, when applied to various challenging downstream tasks, achieves superior results in linear probing, finetuning as well as the zero-shot performance with the guidance of the language encoder.

翻译：本文提出了一个简单而有效的框架 MaskCLIP, 将新提出的蒙面自我蒸馏纳入对比性语言图像培训前阶段。蒙面自我蒸馏的核心理念是将图像从完整的图像蒸馏到蒙面图像预测的表达方式。这种整合具有两个重要好处。首先, 蒙面自我蒸馏目标是局部补丁代表制学习, 这与注重文本代表方式的视觉语言对比性学习是相辅相成的。其次, 蒙面自我蒸馏也与从培训目标角度的愿景语言对比一致,因为培训目标既利用视觉编码器进行功能组合,也能够学习本地语义学,从语言中间接监督。我们提供专门设计的实验,通过全面分析来验证两种好处。我们随机地表明, MaskCLIP 在应用于各种具有挑战性的下游任务时,在线性勘测、微调以及根据语言编码器的指导零发性表现方面,取得了优异的结果。

0

相关内容

contrastive

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NeurIPS 2021教程|OpenAI-Lilian Weng等：自监督学习与对比学习，105页ppt，

NeurIPS 2021教程|OpenAI-Lilian Weng等：自监督学习与对比学习，105页ppt，

专知会员服务

78+阅读 · 2021年12月10日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

碳源胁迫对颗粒污泥稳定性及除磷特性的影响及机制

国家自然科学基金

0+阅读 · 2013年12月31日

闪电影响对流层上部NOx增强的遥感分析和模式研究

国家自然科学基金

0+阅读 · 2013年12月31日

miRNA调控细胞自噬及其在PRRSV增殖中作用与机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Importin α3蛋白在晶状体上皮细胞衰老中的作用及其表观遗传学调控机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去甲基化酶UTX参与哺乳动物衰老及其基因调控网络研究

国家自然科学基金

0+阅读 · 2012年12月31日

ANCA诱导的ROS在调控中性粒细胞凋亡∕NETosis转换中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

硫化氢对再灌注性心肌细胞自噬的影响及其信号转导机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

收益管理中的排序理论及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

稀土RE-Fe-Cr三元系相图及其化合物吸波性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

ClC-3氯通道蛋白在肿瘤转移中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Real-World Robot Learning with Masked Visual Pre-training

Arxiv

0+阅读 · 2022年10月6日

SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders

Arxiv

0+阅读 · 2022年10月5日

Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment

Arxiv

0+阅读 · 2022年10月5日

Masked Supervised Learning for Semantic Segmentation

Arxiv

0+阅读 · 2022年10月3日

Medical Image Understanding with Pretrained Vision Language Models: A Comprehensive Study

Arxiv

0+阅读 · 2022年9月30日

SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation

Arxiv

0+阅读 · 2022年9月30日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Compositional GAN: Learning Conditional Image Composition

Compositional GAN: Learning Conditional Image Composition

Arxiv

31+阅读 · 2018年7月19日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NeurIPS 2021教程|OpenAI-Lilian Weng等：自监督学习与对比学习，105页ppt，

NeurIPS 2021教程|OpenAI-Lilian Weng等：自监督学习与对比学习，105页ppt，

专知会员服务

78+阅读 · 2021年12月10日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Real-World Robot Learning with Masked Visual Pre-training

Arxiv

0+阅读 · 2022年10月6日

SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders

Arxiv

0+阅读 · 2022年10月5日

Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment

Arxiv

0+阅读 · 2022年10月5日

Masked Supervised Learning for Semantic Segmentation

Arxiv

0+阅读 · 2022年10月3日

Medical Image Understanding with Pretrained Vision Language Models: A Comprehensive Study

Arxiv

0+阅读 · 2022年9月30日

SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation

Arxiv

0+阅读 · 2022年9月30日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Compositional GAN: Learning Conditional Image Composition

Compositional GAN: Learning Conditional Image Composition

Arxiv

31+阅读 · 2018年7月19日

相关基金

碳源胁迫对颗粒污泥稳定性及除磷特性的影响及机制

国家自然科学基金

0+阅读 · 2013年12月31日

闪电影响对流层上部NOx增强的遥感分析和模式研究

国家自然科学基金

0+阅读 · 2013年12月31日

miRNA调控细胞自噬及其在PRRSV增殖中作用与机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Importin α3蛋白在晶状体上皮细胞衰老中的作用及其表观遗传学调控机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去甲基化酶UTX参与哺乳动物衰老及其基因调控网络研究

国家自然科学基金

0+阅读 · 2012年12月31日

ANCA诱导的ROS在调控中性粒细胞凋亡∕NETosis转换中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

硫化氢对再灌注性心肌细胞自噬的影响及其信号转导机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

收益管理中的排序理论及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

稀土RE-Fe-Cr三元系相图及其化合物吸波性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

ClC-3氯通道蛋白在肿瘤转移中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员