SdAE: 自我蒸馏的蒙面自动编码器 (SdAE: Self-distillated Masked Autoencoder) - 专知论文

会员服务 ·

0

掩码 · INFORMS · Branch · BEiT · 自编码器 ·

2022 年 7 月 31 日

SdAE: Self-distillated Masked Autoencoder

翻译：SdAE: 自我蒸馏的蒙面自动编码器

Yabo Chen,Yuchen Liu,Dongsheng Jiang,Xiaopeng Zhang,Wenrui Dai,Hongkai Xiong,Qi Tian

from arxiv, Accepted to ECCV 2022

With the development of generative-based self-supervised learning (SSL) approaches like BeiT and MAE, how to learn good representations by masking random patches of the input image and reconstructing the missing information has grown in concern. However, BeiT and PeCo need a "pre-pretraining" stage to produce discrete codebooks for masked patches representing. MAE does not require a pre-training codebook process, but setting pixels as reconstruction targets may introduce an optimization gap between pre-training and downstream tasks that good reconstruction quality may not always lead to the high descriptive capability for the model. Considering the above issues, in this paper, we propose a simple Self-distillated masked AutoEncoder network, namely SdAE. SdAE consists of a student branch using an encoder-decoder structure to reconstruct the missing information, and a teacher branch producing latent representation of masked tokens. We also analyze how to build good views for the teacher branch to produce latent representation from the perspective of information bottleneck. After that, we propose a multi-fold masking strategy to provide multiple masked views with balanced information for boosting the performance, which can also reduce the computational complexity. Our approach generalizes well: with only 300 epochs pre-training, a vanilla ViT-Base model achieves an 84.1% fine-tuning accuracy on ImageNet-1k classification, 48.6 mIOU on ADE20K segmentation, and 48.9 mAP on COCO detection, which surpasses other methods by a considerable margin. Code is available at https://github.com/AbrahamYabo/SdAE.

翻译：随着基于基因的自我监督学习(SSL)方法的发展,如BeiT和MAE,如何通过掩盖输入图像的随机补丁以及重建缺失的信息来学习良好的表现,这引起了人们的关注。然而,BeiT和Peco需要一个“预先培训”阶段,以便为隐蔽的补丁制作独立的代码手册。MAE不需要一个培训前的代码手册程序,但设置像素,因为重建目标可能会在Vi20前和下游任务之间造成最优化的差距,而良好的重建质量并不总是导致模型的高描述能力。考虑到上述问题,我们在本文中建议建立一个简单的自我提炼的隐蔽自动编码网络网络,即SdAE。SdAE包括一个使用编码解码结构来重建隐蔽信息的学生分支,而教师分支则产生隐蔽的代码。我们还分析了如何为教师分支建立良好的观点,以便从信息瓶颈的角度产生可实现的潜值代表。随后,我们提议了一个多倍的缩略图战略,在300A-RODA上提供多重的缩略图。

0

相关内容

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

专知会员服务

90+阅读 · 2020年12月21日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

27+阅读 · 2020年7月23日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

基于非对称扩展的可逆水印研究

国家自然科学基金

0+阅读 · 2015年12月31日

天然截短型蛋白EsDREB2B的抗旱分子机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

超高交联聚苯胺的合成、结构及其对重金属离子和溶解性有机物的共吸附机理

国家自然科学基金

0+阅读 · 2015年12月31日

随机泛函微分方程的动力学性态

国家自然科学基金

0+阅读 · 2012年12月31日

基于群智的开放式数据集成与分析技术研究

国家自然科学基金

2+阅读 · 2012年12月31日

变分与拓扑方法对若干重要椭圆方程的应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

微分方程的分支理论

国家自然科学基金

0+阅读 · 2012年12月31日

一类随机偏微分方程解的存在唯一性和渐近性质

国家自然科学基金

0+阅读 · 2012年12月31日

算子概率论中的算子论和算子代数问题

国家自然科学基金

0+阅读 · 2011年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

Weighted Contrastive Hashing

Arxiv

0+阅读 · 2022年9月28日

Reconstruction-guided attention improves the robustness and shape processing of neural networks

Arxiv

0+阅读 · 2022年9月27日

Generalized Parametric Contrastive Learning

Arxiv

0+阅读 · 2022年9月26日

Multimodal Learning with Channel-Mixing and Masked Autoencoder on Facial Action Unit Detection

Arxiv

0+阅读 · 2022年9月25日

Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection

Arxiv

1+阅读 · 2022年9月25日

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment

Arxiv

0+阅读 · 2022年9月23日

CUTS: A Fully Unsupervised Framework for Medical Image Segmentation

Arxiv

0+阅读 · 2022年9月23日

Masked Autoencoders Are Scalable Vision Learners

Arxiv

27+阅读 · 2021年11月11日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

End-to-End Dense Video Captioning with Masked Transformer

Arxiv

14+阅读 · 2018年4月3日

VIP会员

文章信息

相关主题

相关VIP内容

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

专知会员服务

90+阅读 · 2020年12月21日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

27+阅读 · 2020年7月23日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

相关论文

Weighted Contrastive Hashing

Arxiv

0+阅读 · 2022年9月28日

Reconstruction-guided attention improves the robustness and shape processing of neural networks

Arxiv

0+阅读 · 2022年9月27日

Generalized Parametric Contrastive Learning

Arxiv

0+阅读 · 2022年9月26日

Multimodal Learning with Channel-Mixing and Masked Autoencoder on Facial Action Unit Detection

Arxiv

0+阅读 · 2022年9月25日

Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection

Arxiv

1+阅读 · 2022年9月25日

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment

Arxiv

0+阅读 · 2022年9月23日

CUTS: A Fully Unsupervised Framework for Medical Image Segmentation

Arxiv

0+阅读 · 2022年9月23日

Masked Autoencoders Are Scalable Vision Learners

Arxiv

27+阅读 · 2021年11月11日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

End-to-End Dense Video Captioning with Masked Transformer

Arxiv

14+阅读 · 2018年4月3日

相关基金

基于非对称扩展的可逆水印研究

国家自然科学基金

0+阅读 · 2015年12月31日

天然截短型蛋白EsDREB2B的抗旱分子机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

超高交联聚苯胺的合成、结构及其对重金属离子和溶解性有机物的共吸附机理

国家自然科学基金

0+阅读 · 2015年12月31日

随机泛函微分方程的动力学性态

国家自然科学基金

0+阅读 · 2012年12月31日

基于群智的开放式数据集成与分析技术研究

国家自然科学基金

2+阅读 · 2012年12月31日

变分与拓扑方法对若干重要椭圆方程的应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

微分方程的分支理论

国家自然科学基金

0+阅读 · 2012年12月31日

一类随机偏微分方程解的存在唯一性和渐近性质

国家自然科学基金

0+阅读 · 2012年12月31日

算子概率论中的算子论和算子代数问题

国家自然科学基金

0+阅读 · 2011年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员