为更有效的半监督学习选择标签 (Unsupervised Selective Labeling for More Effective Semi-Supervised Learning) - 专知论文

会员服务 ·

0

标注 · Learning · SSL · 未标记 · 无监督 ·

2022 年 9 月 12 日

Unsupervised Selective Labeling for More Effective Semi-Supervised Learning

翻译：为更有效的半监督学习选择标签

Xudong Wang,Long Lian,Stella X. Yu

from arxiv, Accepted by ECCV 2022

Given an unlabeled dataset and an annotation budget, we study how to selectively label a fixed number of instances so that semi-supervised learning (SSL) on such a partially labeled dataset is most effective. We focus on selecting the right data to label, in addition to usual SSL's propagating labels from labeled data to the rest unlabeled data. This instance selection task is challenging, as without any labeled data we do not know what the objective of learning should be. Intuitively, no matter what the downstream task is, instances to be labeled must be representative and diverse: The former would facilitate label propagation to unlabeled data, whereas the latter would ensure coverage of the entire dataset. We capture this idea by selecting cluster prototypes, either in a pretrained feature space, or along with feature optimization, both without labels. Our unsupervised selective labeling consistently improves SSL methods over state-of-the-art active learning given labeled data, by 8 to 25 times in label efficiency. For example, it boosts FixMatch by 10% (14%) in accuracy on CIFAR-10 (ImageNet-1K) with 0.08% (0.2%) labeled data, demonstrating that small computation spent on selecting what data to label brings significant gain especially under a low annotation budget. Our work sets a new standard for practical and efficient SSL.

翻译：鉴于一个未贴标签的数据集和注释预算,我们研究如何有选择地标签固定的事例数量,使半监督的学习(SSL)在这样一个部分标签的数据集上最为有效。我们注重选择正确的标签数据。除了通常的SSL将标签标签标签从标签数据向其余未贴标签的数据传播出去之外,我们注重选择正确的标签数据。这个实例选择任务具有挑战性,因为没有标签的数据,我们不知道学习的目标是什么。直觉地说,不管下游任务是什么,标签必须具有代表性和多样性:前者将标签标签传播到未贴标签的数据中,而后者将确保整个数据集的覆盖。我们通过在没有标签的情况下在事先训练的功能空间或与功能优化一起选择集成原型来捕捉这一想法。我们未经监督的选择性标签持续改进了SLSL方法,而不是根据标签效率8至25倍。例如,它能将固定的Match 10% (14 %) 提升到不切实际的S-10 标签的精确度, 特别是根据已花费的S-10 的标签,将多少次(I) 的计算结果带来一个相当的精确性的数据。

0

相关内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

专知会员服务

19+阅读 · 2022年3月4日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

AAAI2019论文抢鲜看！48篇自然语言处理/计算机视觉/机器学习最新接受论文！

AAAI2019论文抢鲜看！48篇自然语言处理/计算机视觉/机器学习最新接受论文！

专知

11+阅读 · 2018年11月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

低温环境下光纤光栅传感器的抗疲劳设计

国家自然科学基金

0+阅读 · 2015年12月31日

煤粉非均相MILD燃烧及燃料型NOx生成机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Aurivillius-Sillenite结构光催化材料的性能调控研究

国家自然科学基金

0+阅读 · 2014年12月31日

氟代金属酞菁半导体材料的合成、性能与薄膜器件研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属与锗接触界面微结构改性及势垒高度调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

高维系统混沌特性及其符号动力学同步方法的研究

国家自然科学基金

1+阅读 · 2011年12月31日

真空紫外激发的稀土纳米荧光材料效率的研究

国家自然科学基金

0+阅读 · 2009年12月31日

EGFR2单抗Herceptin修饰紫杉醇纳米胶束联合Survivin基因沉默靶向治疗鼻咽癌的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

纳米金属多层膜延性和疲劳性能的约束效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

考虑制造误差的滑动轴承转子系统非线性动力学分析

国家自然科学基金

0+阅读 · 2008年12月31日

Evaluating and Crafting Datasets Effective for Deep Learning With Data Maps

Arxiv

0+阅读 · 2022年10月24日

Active Learning of Discrete-Time Dynamics for Uncertainty-Aware Model Predictive Control

Arxiv

0+阅读 · 2022年10月23日

Targeted active learning for probabilistic models

Arxiv

0+阅读 · 2022年10月21日

An Adaptive Neighborhood Partition Full Conditional Mutual Information Maximization Method for Feature Selection

Arxiv

0+阅读 · 2022年10月21日

Debiased Self-Training for Semi-Supervised Learning

Arxiv

0+阅读 · 2022年10月21日

Automatic Document Selection for Efficient Encoder Pretraining

Arxiv

0+阅读 · 2022年10月20日

Self-Supervised Representation Learning for CAD

Arxiv

0+阅读 · 2022年10月19日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

A Collective Learning Framework to Boost GNN Expressiveness

A Collective Learning Framework to Boost GNN Expressiveness

Arxiv

20+阅读 · 2020年3月26日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

专知会员服务

19+阅读 · 2022年3月4日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

检索增强生成（RAG）技术，261页slides

美联参会指南-联合规划与执行概述及政策框架 | 32页

从DeepSeek-R1学到的三个核心经验

大规模视觉模型中的提示式适配：综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

AAAI2019论文抢鲜看！48篇自然语言处理/计算机视觉/机器学习最新接受论文！

AAAI2019论文抢鲜看！48篇自然语言处理/计算机视觉/机器学习最新接受论文！

专知

11+阅读 · 2018年11月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Evaluating and Crafting Datasets Effective for Deep Learning With Data Maps

Arxiv

0+阅读 · 2022年10月24日

Active Learning of Discrete-Time Dynamics for Uncertainty-Aware Model Predictive Control

Arxiv

0+阅读 · 2022年10月23日

Targeted active learning for probabilistic models

Arxiv

0+阅读 · 2022年10月21日

An Adaptive Neighborhood Partition Full Conditional Mutual Information Maximization Method for Feature Selection

Arxiv

0+阅读 · 2022年10月21日

Debiased Self-Training for Semi-Supervised Learning

Arxiv

0+阅读 · 2022年10月21日

Automatic Document Selection for Efficient Encoder Pretraining

Arxiv

0+阅读 · 2022年10月20日

Self-Supervised Representation Learning for CAD

Arxiv

0+阅读 · 2022年10月19日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

A Collective Learning Framework to Boost GNN Expressiveness

A Collective Learning Framework to Boost GNN Expressiveness

Arxiv

20+阅读 · 2020年3月26日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

相关基金

低温环境下光纤光栅传感器的抗疲劳设计

国家自然科学基金

0+阅读 · 2015年12月31日

煤粉非均相MILD燃烧及燃料型NOx生成机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Aurivillius-Sillenite结构光催化材料的性能调控研究

国家自然科学基金

0+阅读 · 2014年12月31日

氟代金属酞菁半导体材料的合成、性能与薄膜器件研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属与锗接触界面微结构改性及势垒高度调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

高维系统混沌特性及其符号动力学同步方法的研究

国家自然科学基金

1+阅读 · 2011年12月31日

真空紫外激发的稀土纳米荧光材料效率的研究

国家自然科学基金

0+阅读 · 2009年12月31日

EGFR2单抗Herceptin修饰紫杉醇纳米胶束联合Survivin基因沉默靶向治疗鼻咽癌的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

纳米金属多层膜延性和疲劳性能的约束效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

考虑制造误差的滑动轴承转子系统非线性动力学分析

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员