Disco:利用蒸馏的反竞争学习,在轻量级模型上进行自我监督的自学 (DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning) - 专知论文

会员服务 ·

0

Learning · contrastive · 蒸馏 · 对比学习 · MoDELS ·

2022 年 7 月 4 日

DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

翻译：Disco:利用蒸馏的反竞争学习,在轻量级模型上进行自我监督的自学

Yuting Gao,Jia-Xin Zhuang,Shaohui Lin,Hao Cheng,Xing Sun,Ke Li,Chunhua Shen

from arxiv, ECCV 2022

While self-supervised representation learning (SSL) has received widespread attention from the community, recent research argue that its performance will suffer a cliff fall when the model size decreases. The current method mainly relies on contrastive learning to train the network and in this work, we propose a simple yet effective Distilled Contrastive Learning (DisCo) to ease the issue by a large margin. Specifically, we find the final embedding obtained by the mainstream SSL methods contains the most fruitful information, and propose to distill the final embedding to maximally transmit a teacher's knowledge to a lightweight model by constraining the last embedding of the student to be consistent with that of the teacher. In addition, in the experiment, we find that there exists a phenomenon termed Distilling BottleNeck and present to enlarge the embedding dimension to alleviate this problem. Our method does not introduce any extra parameter to lightweight models during deployment. Experimental results demonstrate that our method achieves the state-of-the-art on all lightweight models. Particularly, when ResNet-101/ResNet-50 is used as teacher to teach EfficientNet-B0, the linear result of EfficientNet-B0 on ImageNet is very close to ResNet-101/ResNet-50, but the number of parameters of EfficientNet-B0 is only 9.4\%/16.3\% of ResNet-101/ResNet-50. Code is available at https://github. com/Yuting-Gao/DisCo-pytorch.

翻译：虽然自我监督的代表学习(SSL)得到了社区的广泛关注,但最近的研究表明,当模型规模缩小时,其表现将受到悬崖式下降的影响。目前的方法主要依靠对比性学习来培训网络和这项工作,我们提出一个简单而有效的蒸馏对比学习(Disco)来大大缓解这一问题。具体地说,我们发现主流SSL方法的最终嵌入包含最有成果的信息,并提议通过限制学生最后一次嵌入与教师的嵌入,将教师的知识最大限度地传递到轻量模型中。此外,在实验中,我们发现存在一种叫蒸馏BottleNeck并正在扩大嵌入层面以缓解这一问题的现象。我们的方法不会在部署期间对轻量模型引入任何额外的参数。实验结果显示,我们的方法在所有轻量模型上都达到了最新水平。特别是当ResNet-101/ResNet-50作为教师最接近的嵌入点时, ASNet-50-Net-Netxal AS-ROQQQONB的运行结果是SS-ROQQ-RONB AS-SANS-NLOQ ISNLOQQQQ ISNEMONLONEM ISNET ISQQQQQQQQQQQQQLISNEMOQQQQQQQQQQQQQQQQLONEMONLISNLONLONLOQQQLOQLOQLOQLOQLOQLOQLISNB ISNB SSLINSG/QLLLLLINSLINS/QLISDLO/QLO/QLO/QLISDLISDLISDLISDLOQLO/QLISDLINSLO/QLO/QLODLODLODLLLLLLLODLOBDLOBDLOBDLOBDLOBDLDLDLLIFF的运行的运行的线性结果。

0

相关内容

Learning

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

专知会员服务

24+阅读 · 2019年11月4日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Fractalkine/CX3CR1增加老年急性缺血性肾损伤易感性的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

RIP1/RIP3通路调控脑出血后细胞程序性坏死的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA RP11-1100L3.8对非小细胞肺癌厄洛替尼耐药的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

原发性胆汁性肝硬化中长链非编码RNA-GACAT1对肝内胆管细胞的调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cbl家族调控c-Met介导的非小细胞肺癌放疗抵抗机制的研究

国家自然科学基金

1+阅读 · 2014年12月31日

神经介素S参与猪体内的免疫调节途径和调节作用的探索

国家自然科学基金

0+阅读 · 2013年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

石菖蒲抗阿尔茨海默病(AD)的药效物质基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

疏肝益肾方抗乳腺癌内分泌治疗耐药的增效机制

国家自然科学基金

0+阅读 · 2011年12月31日

Deep Symbolic Learning: Discovering Symbols and Rules from Perceptions

Arxiv

0+阅读 · 2022年8月24日

SCALE: Online Self-Supervised Lifelong Learning without Prior Knowledge

Arxiv

0+阅读 · 2022年8月24日

A Generic Self-Supervised Framework of Learning Invariant Discriminative Features

Arxiv

0+阅读 · 2022年8月21日

Task-load-Aware Game-Theoretic Framework for Wireless Federated Learning

Arxiv

0+阅读 · 2022年8月20日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Faster Meta Update Strategy for Noise-Robust Deep Learning

Arxiv

11+阅读 · 2021年4月30日

Self-supervised Learning: Generative or Contrastive

Arxiv

25+阅读 · 2021年3月20日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Arxiv

12+阅读 · 2019年9月26日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

专知会员服务

24+阅读 · 2019年11月4日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Deep Symbolic Learning: Discovering Symbols and Rules from Perceptions

Arxiv

0+阅读 · 2022年8月24日

SCALE: Online Self-Supervised Lifelong Learning without Prior Knowledge

Arxiv

0+阅读 · 2022年8月24日

A Generic Self-Supervised Framework of Learning Invariant Discriminative Features

Arxiv

0+阅读 · 2022年8月21日

Task-load-Aware Game-Theoretic Framework for Wireless Federated Learning

Arxiv

0+阅读 · 2022年8月20日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Faster Meta Update Strategy for Noise-Robust Deep Learning

Arxiv

11+阅读 · 2021年4月30日

Self-supervised Learning: Generative or Contrastive

Arxiv

25+阅读 · 2021年3月20日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Arxiv

12+阅读 · 2019年9月26日

相关基金

Fractalkine/CX3CR1增加老年急性缺血性肾损伤易感性的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

RIP1/RIP3通路调控脑出血后细胞程序性坏死的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA RP11-1100L3.8对非小细胞肺癌厄洛替尼耐药的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

原发性胆汁性肝硬化中长链非编码RNA-GACAT1对肝内胆管细胞的调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cbl家族调控c-Met介导的非小细胞肺癌放疗抵抗机制的研究

国家自然科学基金

1+阅读 · 2014年12月31日

神经介素S参与猪体内的免疫调节途径和调节作用的探索

国家自然科学基金

0+阅读 · 2013年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

石菖蒲抗阿尔茨海默病(AD)的药效物质基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

疏肝益肾方抗乳腺癌内分泌治疗耐药的增效机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员