学习实例和分布式视觉演示的极端掩码 (Extreme Masking for Learning Instance and Distributed Visual Representations) - 专知论文

会员服务 ·

0

Learning · 示例 · 掩码 · 分布式表示 · Attention ·

2022 年 6 月 9 日

Extreme Masking for Learning Instance and Distributed Visual Representations

翻译：学习实例和分布式视觉演示的极端掩码

Zhirong Wu,Zihang Lai,Xiao Sun,Stephen Lin

from arxiv, Technical Report

The paper presents a scalable approach for learning distributed representations over individual tokens and a holistic instance representation simultaneously. We use self-attention blocks to represent distributed tokens, followed by cross-attention blocks to aggregate the holistic instance. The core of the approach is the use of extremely large token masking (75%-90%) as the data augmentation for supervision. Our model, named ExtreMA, follows the plain BYOL approach where the instance representation from the unmasked subset is trained to predict that from the intact input. Learning requires the model to capture informative variations in an instance, instead of encouraging invariances. The paper makes three contributions: 1) Random masking is a strong and computationally efficient data augmentation for learning generalizable attention representations. 2) With multiple sampling per instance, extreme masking greatly speeds up learning and hungers for more data. 3) Distributed representations can be learned from the instance supervision alone, unlike per-token supervisions in masked modeling.

翻译：本文介绍了一种可扩展的方法,用于学习单个象征的分布式表达方式和整体实例表达方式。我们使用自我注意区块来代表分布式表示方式,然后使用交叉注意区块来汇总整体实例。这种方法的核心是使用极大型象征性掩码(75%-90%)作为数据增强监督功能。我们的模型叫做ExtreMA, 遵循平原 BYOL 方法, 即无面子组别的实例代表方式经过培训, 以从完整输入中预测。学习要求模型在某个实例中捕捉信息变异, 而不是鼓励变异。论文做出了三项贡献:(1) 随机掩码是一种强大的、计算效率高的数据增强, 以学习可普遍注意的表示方式。 (2) 通过多次取样, 极端掩码大大加快学习速度, 渴望获得更多数据。 3) 光从实例监督中可以学习分布式表达方式, 与蒙面模型中的按人监督方法不同。

0

相关内容

Learning

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

重组人磷脂酶D2干预哮喘中Treg特征的研究

国家自然科学基金

0+阅读 · 2016年12月31日

局部火灾下钢管混凝土组合框架连续倒塌机理与设防对策

国家自然科学基金

0+阅读 · 2012年12月31日

拟南芥DIF（DRIP1-Interacting Factor）在胁迫信号应答中的功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

哮喘中T细胞活化衔接子对调节性T细胞调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

预应力筋约束混凝土局部受压受力性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

蛋白磷酸化酶(Calcineurin)在钙调节蛋白及钙离子诱导下的构象调控机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

喉鳞癌中趋化因子受体介导CD4+CD25+调节性T细胞增殖的作用机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

急性淋巴细胞白血病（ALL）逃逸NK细胞杀伤的机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

ClC-3氯通道蛋白在肿瘤转移中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

SecretGen: Privacy Recovery on Pre-Trained Models via Distribution Discrimination

Arxiv

0+阅读 · 2022年7月25日

Estimating Extreme Value Index by Subsampling for Massive Datasets with Heavy-Tailed Distributions

Arxiv

0+阅读 · 2022年7月25日

Spatial-Temporal Federated Learning for Lifelong Person Re-identification on Distributed Edges

Arxiv

0+阅读 · 2022年7月24日

Receptive Field-based Segmentation for Distributed CNN Inference Acceleration in Collaborative Edge Computing

Arxiv

0+阅读 · 2022年7月22日

In Defense of Online Models for Video Instance Segmentation

Arxiv

0+阅读 · 2022年7月21日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Evolving Losses for Unsupervised Video Representation Learning

Arxiv

23+阅读 · 2020年2月26日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

VIP会员

文章信息

相关主题

分布式表示

相关VIP内容

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

SecretGen: Privacy Recovery on Pre-Trained Models via Distribution Discrimination

Arxiv

0+阅读 · 2022年7月25日

Estimating Extreme Value Index by Subsampling for Massive Datasets with Heavy-Tailed Distributions

Arxiv

0+阅读 · 2022年7月25日

Spatial-Temporal Federated Learning for Lifelong Person Re-identification on Distributed Edges

Arxiv

0+阅读 · 2022年7月24日

Receptive Field-based Segmentation for Distributed CNN Inference Acceleration in Collaborative Edge Computing

Arxiv

0+阅读 · 2022年7月22日

In Defense of Online Models for Video Instance Segmentation

Arxiv

0+阅读 · 2022年7月21日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Evolving Losses for Unsupervised Video Representation Learning

Arxiv

23+阅读 · 2020年2月26日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

相关基金

重组人磷脂酶D2干预哮喘中Treg特征的研究

国家自然科学基金

0+阅读 · 2016年12月31日

局部火灾下钢管混凝土组合框架连续倒塌机理与设防对策

国家自然科学基金

0+阅读 · 2012年12月31日

拟南芥DIF（DRIP1-Interacting Factor）在胁迫信号应答中的功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

哮喘中T细胞活化衔接子对调节性T细胞调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

预应力筋约束混凝土局部受压受力性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

蛋白磷酸化酶(Calcineurin)在钙调节蛋白及钙离子诱导下的构象调控机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

喉鳞癌中趋化因子受体介导CD4+CD25+调节性T细胞增殖的作用机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

急性淋巴细胞白血病（ALL）逃逸NK细胞杀伤的机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

ClC-3氯通道蛋白在肿瘤转移中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员