加权集成自监督学习 (Weighted Ensemble Self-Supervised Learning) - 专知论文

会员服务 ·

0

集成 · SSL · 监督学习 · MSN · 监督 ·

2023 年 4 月 9 日

Weighted Ensemble Self-Supervised Learning

翻译：加权集成自监督学习

Yangjun Ruan,Saurabh Singh,Warren Morningstar,Alexander A. Alemi,Sergey Ioffe,Ian Fischer,Joshua V. Dillon

from arxiv, Accepted by ICLR 2023

Ensembling has proven to be a powerful technique for boosting model performance, uncertainty estimation, and robustness in supervised learning. Advances in self-supervised learning (SSL) enable leveraging large unlabeled corpora for state-of-the-art few-shot and supervised learning performance. In this paper, we explore how ensemble methods can improve recent SSL techniques by developing a framework that permits data-dependent weighted cross-entropy losses. We refrain from ensembling the representation backbone; this choice yields an efficient ensemble method that incurs a small training cost and requires no architectural changes or computational overhead to downstream evaluation. The effectiveness of our method is demonstrated with two state-of-the-art SSL methods, DINO (Caron et al., 2021) and MSN (Assran et al., 2022). Our method outperforms both in multiple evaluation metrics on ImageNet-1K, particularly in the few-shot setting. We explore several weighting schemes and find that those which increase the diversity of ensemble heads lead to better downstream evaluation results. Thorough experiments yield improved prior art baselines which our method still surpasses; e.g., our overall improvement with MSN ViT-B/16 is 3.9 p.p. for 1-shot learning.

翻译：集成已被证明是提高模型性能、不确定性估计和鲁棒性的强大技术。自监督学习 (SSL) 的进步使得能够利用大规模未标记的语料库，实现最先进的少样本和监督学习性能。本文探讨如何通过开发一种允许数据相关加权交叉熵损失函数的框架来改进最近的 SSL 技术。我们不对表示骨干进行集成；这种选择产生了一种高效的集成方法，它产生小的训练成本，不需要对下游评估进行架构更改或计算开销。我们的方法的有效性通过两种最先进的 SSL 方法 DINO (Caron et al.，2021) 和 MSN (Assran et al.，2022) 得以证明。我们的方法在 ImageNet-1K 上的多种评估指标中优于两个方法，尤其是在少样本设置下。我们探究了多种加权方案，并发现增加集成头的多样性会导致更好的下游评估结果。全面的实验表明了改进的先前技术基线，我们的方法仍然超越了其性能；例如，对于 MSN ViT-B/16 的 1-shot 学习，我们的整体改进为 3.9 p.p。

0

相关内容

自监督学习在CV进展？何恺明等最新ECCV2022教程《自监督表示学习在计算机视觉》，全面讲述自监督视觉学习进展

自监督学习在CV进展？何恺明等最新ECCV2022教程《自监督表示学习在计算机视觉》，全面讲述自监督视觉学习进展

专知会员服务

54+阅读 · 2022年12月10日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

如何用好对比学习？CVPR2021谷歌ChenTing《自监督视觉表示学习》报告，附视频与Slides

如何用好对比学习？CVPR2021谷歌ChenTing《自监督视觉表示学习》报告，附视频与Slides

专知会员服务

38+阅读 · 2021年6月21日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【康奈尔大学-Facebook】特征归一化与数据增强，Feature Normalization

【康奈尔大学-Facebook】特征归一化与数据增强，Feature Normalization

专知会员服务

57+阅读 · 2020年3月9日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【AAAI2020接受论文】多任务自监督学习的不流利检测，Multi-Task Self-Supervised Learning for Disfluency Detection

【AAAI2020接受论文】多任务自监督学习的不流利检测，Multi-Task Self-Supervised Learning for Disfluency Detection

专知会员服务

14+阅读 · 2019年11月11日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

解读自监督学习(Self-Supervised Learning)几篇相关paper

解读自监督学习(Self-Supervised Learning)几篇相关paper

CVer

25+阅读 · 2020年2月21日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

基于图的半监督学习算法研究

国家自然科学基金

5+阅读 · 2015年12月31日

核受体PPARγ组装DNA折纸色谱药物筛选新方法的研究

国家自然科学基金

0+阅读 · 2013年12月31日

电针预处理调节硫氧还蛋白系统促进创伤后应激障碍恢复的研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性对称锥规划的同伦算法及应用

国家自然科学基金

0+阅读 · 2013年12月31日

多源空间线目标全局最优化与逻辑回归匹配方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

多尺度道路数据监督学习的匹配与选取更新方法

国家自然科学基金

0+阅读 · 2013年12月31日

网络新媒体服务系统的建模及其动力学行为分析研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于半监督结构化学习的跨语言映射研究

国家自然科学基金

2+阅读 · 2011年12月31日

基于多目标遗传算法的大客车骨架结构协同拓扑优化方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

蛋白诱导iPS复合明胶/PLGA核壳型管状支架构建组织工程化脊髓及修复脊髓损伤的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

Balanced Supervised Contrastive Learning for Few-Shot Class-Incremental Learning

Arxiv

0+阅读 · 2023年5月26日

Summarizing Stream Data for Memory-Restricted Online Continual Learning

Arxiv

0+阅读 · 2023年5月26日

Representation Transfer Learning via Multiple Pre-trained models for Linear Regression

Arxiv

0+阅读 · 2023年5月25日

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

Arxiv

0+阅读 · 2023年5月25日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

SiT: Self-supervised vIsion Transformer

Arxiv

19+阅读 · 2021年4月8日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Self-supervised Geometric Perception

Arxiv

24+阅读 · 2021年3月4日

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Arxiv

15+阅读 · 2021年1月21日

Attention-based Ensemble for Deep Metric Learning

Arxiv

17+阅读 · 2018年4月2日

VIP会员

文章信息

相关主题

相关VIP内容

自监督学习在CV进展？何恺明等最新ECCV2022教程《自监督表示学习在计算机视觉》，全面讲述自监督视觉学习进展

自监督学习在CV进展？何恺明等最新ECCV2022教程《自监督表示学习在计算机视觉》，全面讲述自监督视觉学习进展

专知会员服务

54+阅读 · 2022年12月10日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

如何用好对比学习？CVPR2021谷歌ChenTing《自监督视觉表示学习》报告，附视频与Slides

如何用好对比学习？CVPR2021谷歌ChenTing《自监督视觉表示学习》报告，附视频与Slides

专知会员服务

38+阅读 · 2021年6月21日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【康奈尔大学-Facebook】特征归一化与数据增强，Feature Normalization

【康奈尔大学-Facebook】特征归一化与数据增强，Feature Normalization

专知会员服务

57+阅读 · 2020年3月9日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【AAAI2020接受论文】多任务自监督学习的不流利检测，Multi-Task Self-Supervised Learning for Disfluency Detection

【AAAI2020接受论文】多任务自监督学习的不流利检测，Multi-Task Self-Supervised Learning for Disfluency Detection

专知会员服务

14+阅读 · 2019年11月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《战区安全决策课程体系》最新244页

《"无人机航母"原型平台》

任务规划与地形分析：现代复杂环境作战导航体系

《攻击场景描述形式化模型研究》

相关资讯

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

解读自监督学习(Self-Supervised Learning)几篇相关paper

解读自监督学习(Self-Supervised Learning)几篇相关paper

CVer

25+阅读 · 2020年2月21日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

相关论文

Balanced Supervised Contrastive Learning for Few-Shot Class-Incremental Learning

Arxiv

0+阅读 · 2023年5月26日

Summarizing Stream Data for Memory-Restricted Online Continual Learning

Arxiv

0+阅读 · 2023年5月26日

Representation Transfer Learning via Multiple Pre-trained models for Linear Regression

Arxiv

0+阅读 · 2023年5月25日

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

Arxiv

0+阅读 · 2023年5月25日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

SiT: Self-supervised vIsion Transformer

Arxiv

19+阅读 · 2021年4月8日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Self-supervised Geometric Perception

Arxiv

24+阅读 · 2021年3月4日

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Arxiv

15+阅读 · 2021年1月21日

Attention-based Ensemble for Deep Metric Learning

Arxiv

17+阅读 · 2018年4月2日

相关基金

基于图的半监督学习算法研究

国家自然科学基金

5+阅读 · 2015年12月31日

核受体PPARγ组装DNA折纸色谱药物筛选新方法的研究

国家自然科学基金

0+阅读 · 2013年12月31日

电针预处理调节硫氧还蛋白系统促进创伤后应激障碍恢复的研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性对称锥规划的同伦算法及应用

国家自然科学基金

0+阅读 · 2013年12月31日

多源空间线目标全局最优化与逻辑回归匹配方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

多尺度道路数据监督学习的匹配与选取更新方法

国家自然科学基金

0+阅读 · 2013年12月31日

网络新媒体服务系统的建模及其动力学行为分析研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于半监督结构化学习的跨语言映射研究

国家自然科学基金

2+阅读 · 2011年12月31日

基于多目标遗传算法的大客车骨架结构协同拓扑优化方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

蛋白诱导iPS复合明胶/PLGA核壳型管状支架构建组织工程化脊髓及修复脊髓损伤的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员