高效的自我监督的 " 代表性学习愿景变革者 " (Efficient Self-supervised Vision Transformers for Representation Learning) - 专知论文

会员服务 ·

0

Learning · Vision · 变换 · 可约的 · 线性的 ·

2022 年 7 月 6 日

Efficient Self-supervised Vision Transformers for Representation Learning

翻译：高效的自我监督的 " 代表性学习愿景变革者 "

Chunyuan Li,Jianwei Yang,Pengchuan Zhang,Mei Gao,Bin Xiao,Xiyang Dai,Lu Yuan,Jianfeng Gao

from arxiv, ICLR 2022; Code: https://github.com/microsoft/esvit

This paper investigates two techniques for developing efficient self-supervised vision transformers (EsViT) for visual representation learning. First, we show through a comprehensive empirical study that multi-stage architectures with sparse self-attentions can significantly reduce modeling complexity but with a cost of losing the ability to capture fine-grained correspondences between image regions. Second, we propose a new pre-training task of region matching which allows the model to capture fine-grained region dependencies and as a result significantly improves the quality of the learned vision representations. Our results show that combining the two techniques, EsViT achieves 81.3% top-1 on the ImageNet linear probe evaluation, outperforming prior arts with around an order magnitude of higher throughput. When transferring to downstream linear classification tasks, EsViT outperforms its supervised counterpart on 17 out of 18 datasets. The code and models are publicly available: https://github.com/microsoft/esvit

翻译：本文探讨了开发高效自我监督的视觉变压器(EsViT)以进行视觉演示学习的两种技术。首先,我们通过全面的经验性研究表明,多阶段结构的自我关注程度低,可以大大降低模型的复杂性,但代价是丧失捕捉图像区域之间微粒通信的能力。第二,我们建议开展新的区域比对培训前任务,使模型能够捕捉微弱的区域依赖性,从而大大改善所学的视觉演示质量。我们的成果显示,将两种技术结合起来,EsViT在图像网络线性探测器评价中实现了81.3%的顶级-1,比以往艺术表现高,其吞吐量水平高。在向下游线性分类任务转移时,EsViT在18个数据集中的17个中超越了监督对应方。代码和模型可以公开查阅:https://github.com/microsoft/esvit。

0

相关内容

Learning

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

离子液体电解制备铜粉的调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

磁场对离子液体中电沉积铝的作用

国家自然科学基金

0+阅读 · 2015年12月31日

复盐固溶-电解共沉积制备稀土镁合金的基础研究

国家自然科学基金

0+阅读 · 2015年12月31日

全钒液流电池电流密度分布特性及其调控机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型g-C3N4-AgCl/Ag3PO4复合光催化材料的制备及其降解对羟基苯甲酸酯类化合物的机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

雄性激素受体通过CCRK促进胃癌细胞增殖的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

microRNA与新生鼠缺氧缺血脑损伤的细胞自噬调控

国家自然科学基金

0+阅读 · 2013年12月31日

脂肪族聚碳酸酯基固体聚合物电解质的制备及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

带有随机参数输入的非线性双曲型方程的数值方法

国家自然科学基金

0+阅读 · 2012年12月31日

聚合离子液体多孔材料的制备及其在CO2捕集中的应用基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment

Arxiv

0+阅读 · 2022年8月29日

Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

Arxiv

28+阅读 · 2022年6月8日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

Cross-Modal Discrete Representation Learning

Arxiv

18+阅读 · 2021年6月10日

SiT: Self-supervised vIsion Transformer

Arxiv

19+阅读 · 2021年4月8日

Unifying Vision-and-Language Tasks via Text Generation

Arxiv

10+阅读 · 2021年2月4日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《海战法：海战中的人工智能与自主系统》最新45页

《美军条令：行动后评估》2025最新36页

中文版 | 先进通信技术

《国防系统提升可靠性与维护性评估效能的实践准则》最新64页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment

Arxiv

0+阅读 · 2022年8月29日

Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

Arxiv

28+阅读 · 2022年6月8日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

Cross-Modal Discrete Representation Learning

Arxiv

18+阅读 · 2021年6月10日

SiT: Self-supervised vIsion Transformer

Arxiv

19+阅读 · 2021年4月8日

Unifying Vision-and-Language Tasks via Text Generation

Arxiv

10+阅读 · 2021年2月4日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

相关基金

离子液体电解制备铜粉的调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

磁场对离子液体中电沉积铝的作用

国家自然科学基金

0+阅读 · 2015年12月31日

复盐固溶-电解共沉积制备稀土镁合金的基础研究

国家自然科学基金

0+阅读 · 2015年12月31日

全钒液流电池电流密度分布特性及其调控机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型g-C3N4-AgCl/Ag3PO4复合光催化材料的制备及其降解对羟基苯甲酸酯类化合物的机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

雄性激素受体通过CCRK促进胃癌细胞增殖的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

microRNA与新生鼠缺氧缺血脑损伤的细胞自噬调控

国家自然科学基金

0+阅读 · 2013年12月31日

脂肪族聚碳酸酯基固体聚合物电解质的制备及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

带有随机参数输入的非线性双曲型方程的数值方法

国家自然科学基金

0+阅读 · 2012年12月31日

聚合离子液体多孔材料的制备及其在CO2捕集中的应用基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员