【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等 - 专知VIP

会员服务 ·

0

Google · 卡内基梅隆大学 (Carnegie Mellon University) · 自监督学习 · 深度学习 · Qizhe Xie ·

2019 年 11 月 20 日

【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

论文题目： Self-training with Noisy Student improves ImageNet classification

论文摘要： 我们提出了一种简单的自我训练方法，在ImageNet上达到87.4%的top-1精度，比目前最先进的需要3.5B弱标记Instagram图像的模型好1.0%。在稳健性测试集上，它将imagnet-A的最高精度从16.6%提高到74.2%，将imagnet-C的平均损坏误差从45.7降低到31.2，并将imagnet-P的平均翻转率从27.8降低到16.1。为了达到这一目的，我们首先在标注的ImageNet图像上训练了一个EfficientNet模型，然后用这个模型作为老师在3亿无标签图像上生成伪标签。然后又训练了一个更大的EfficientNet作为学生student模型，使用的数据则是正确标注图像和伪标注图像的混合数据。这一过程不断迭代，每个新的学生模型作为下一轮的老师模型。在伪标签的生成过程中，老师模型不受噪声干扰，所以生成的伪标注会尽可能逼真。但是在学生模型的学习过程中，我们对数据加入了噪声，使用了诸如数据增强、dropout、随机深度等方法，使得学生模型在从伪标签训练的过程中更加艰难。

作者简介：

Quoc V. Le，谷歌研究科学家，斯坦福大学计算机科学系人工智能实验室博士生。 Qizhe Xie，卡内基梅隆大学机器学习系博士研究生，感兴趣的方向：深度学习、自然语言处理、计算机视觉。等

成为VIP会员查看完整内容

24

相关内容

Google

一家美国的跨国科技企业，致力于互联网搜索、云计算、广告技术等领域，由当时在斯坦福大学攻读理学博士的拉里·佩奇和谢尔盖·布林共同创建。创始之初，Google 官方的公司使命为「整合全球范围的信息，使人人皆可访问并从中受益」。 Google 开发并提供了大量基于互联网的产品与服务，其主要利润来自于 AdWords 等广告服务。

2004 年 8 月 19 日，公司以「GOOG」为代码正式登陆纳斯达克交易所。

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【Google】监督对比学习，Supervised Contrastive Learning

【Google】监督对比学习，Supervised Contrastive Learning

专知会员服务

75+阅读 · 2020年4月24日

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

专知会员服务

147+阅读 · 2020年4月11日

【ICML2020投稿论文】用于半监督图像分类的CowMask，Milking CowMask for Semi-Supervised Image Classification

【ICML2020投稿论文】用于半监督图像分类的CowMask，Milking CowMask for Semi-Supervised Image Classification

专知会员服务

29+阅读 · 2020年3月27日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

【Google AI】开源NoisyStudent：自监督图像分类

【Google AI】开源NoisyStudent：自监督图像分类

专知会员服务

55+阅读 · 2020年2月18日

重磅！Geoffrey Hinton新论文「视觉表示对比学习简单框架」自监督学习建立新SOTA-ImageNet准确率76.5%

重磅！Geoffrey Hinton新论文「视觉表示对比学习简单框架」自监督学习建立新SOTA-ImageNet准确率76.5%

专知会员服务

33+阅读 · 2020年2月15日

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

专知会员服务

14+阅读 · 2020年1月1日

【斯坦福大学】对抗性表征主动学习，Adversarial Representation Active Learning

【斯坦福大学】对抗性表征主动学习，Adversarial Representation Active Learning

专知会员服务

45+阅读 · 2019年12月20日

【用十亿级半监督学习实现最先进图像与视频分类】《Billion-scale semi-supervised learning for state-of-the-art image and video classification | Facebook》

【用十亿级半监督学习实现最先进图像与视频分类】《Billion-scale semi-supervised learning for state-of-the-art image and video classification | Facebook》

专知会员服务

16+阅读 · 2019年10月21日

Google AI再出大杀器！自监督学习ImageNet识别率历史新高87.4%，Jeff Dean点赞Quoc Le新论文

Google AI再出大杀器！自监督学习ImageNet识别率历史新高87.4%，Jeff Dean点赞Quoc Le新论文

专知

6+阅读 · 2019年11月13日

谷歌提出新分类损失函数：将噪声对训练结果影响降到最低

谷歌提出新分类损失函数：将噪声对训练结果影响降到最低

量子位

8+阅读 · 2019年8月28日

如何优化你的图像分类模型效果？

如何优化你的图像分类模型效果？

AI研习社

4+阅读 · 2019年5月26日

280万样本！谷歌开放史上最大分割掩码数据集，开启新一轮挑战赛

280万样本！谷歌开放史上最大分割掩码数据集，开启新一轮挑战赛

极市平台

4+阅读 · 2019年5月10日

10亿级数据规模的半监督图像分类模型，Imagenet测试精度高达81.2％ | 技术头条

10亿级数据规模的半监督图像分类模型，Imagenet测试精度高达81.2％ | 技术头条

AI100

7+阅读 · 2019年5月7日

图像分类算法优化技巧：Bag of Tricks for Image Classification

图像分类算法优化技巧：Bag of Tricks for Image Classification

极市平台

9+阅读 · 2018年12月28日

【干货】Yann Lecun自监督学习指南（附114页Slides全文）

【干货】Yann Lecun自监督学习指南（附114页Slides全文）

GAN生成式对抗网络

94+阅读 · 2018年12月19日

Yann Lecun自监督学习指南（附114页Slides全文下载）

Yann Lecun自监督学习指南（附114页Slides全文下载）

专知

53+阅读 · 2018年12月19日

用这种方法实现无监督端到端图像分类！（附论文）

用这种方法实现无监督端到端图像分类！（附论文）

数据派THU

8+阅读 · 2018年8月10日

IBM新论文|SamplePairing：针对图像处理领域的高效数据增强方式

IBM新论文|SamplePairing：针对图像处理领域的高效数据增强方式

极市平台

16+阅读 · 2018年1月20日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Self-training with Noisy Student improves ImageNet classification

Arxiv

15+阅读 · 2019年11月11日

Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Arxiv

7+阅读 · 2019年2月8日

Rethinking ImageNet Pre-training

Arxiv

8+阅读 · 2018年11月21日

Few Shot Learning with Simplex

Few Shot Learning with Simplex

Arxiv

5+阅读 · 2018年7月27日

Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

Arxiv

3+阅读 · 2018年6月26日

Learning Instance Segmentation by Interaction

Arxiv

6+阅读 · 2018年6月21日

Noise2Noise: Learning Image Restoration without Clean Data

Arxiv

5+阅读 · 2018年3月12日

TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation

Arxiv

5+阅读 · 2018年1月17日

Train Once, Test Anywhere: Zero-Shot Learning for Text Classification

Arxiv

4+阅读 · 2017年12月23日

VIP会员

相关主题

卡内基梅隆大学 (Carnegie Mellon University)

自监督学习

相关VIP内容

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【Google】监督对比学习，Supervised Contrastive Learning

【Google】监督对比学习，Supervised Contrastive Learning

专知会员服务

75+阅读 · 2020年4月24日

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

专知会员服务

147+阅读 · 2020年4月11日

【ICML2020投稿论文】用于半监督图像分类的CowMask，Milking CowMask for Semi-Supervised Image Classification

【ICML2020投稿论文】用于半监督图像分类的CowMask，Milking CowMask for Semi-Supervised Image Classification

专知会员服务

29+阅读 · 2020年3月27日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

【Google AI】开源NoisyStudent：自监督图像分类

【Google AI】开源NoisyStudent：自监督图像分类

专知会员服务

55+阅读 · 2020年2月18日

重磅！Geoffrey Hinton新论文「视觉表示对比学习简单框架」自监督学习建立新SOTA-ImageNet准确率76.5%

重磅！Geoffrey Hinton新论文「视觉表示对比学习简单框架」自监督学习建立新SOTA-ImageNet准确率76.5%

专知会员服务

33+阅读 · 2020年2月15日

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

专知会员服务

14+阅读 · 2020年1月1日

【斯坦福大学】对抗性表征主动学习，Adversarial Representation Active Learning

【斯坦福大学】对抗性表征主动学习，Adversarial Representation Active Learning

专知会员服务

45+阅读 · 2019年12月20日

【用十亿级半监督学习实现最先进图像与视频分类】《Billion-scale semi-supervised learning for state-of-the-art image and video classification | Facebook》

【用十亿级半监督学习实现最先进图像与视频分类】《Billion-scale semi-supervised learning for state-of-the-art image and video classification | Facebook》

专知会员服务

16+阅读 · 2019年10月21日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

Google AI再出大杀器！自监督学习ImageNet识别率历史新高87.4%，Jeff Dean点赞Quoc Le新论文

Google AI再出大杀器！自监督学习ImageNet识别率历史新高87.4%，Jeff Dean点赞Quoc Le新论文

专知

6+阅读 · 2019年11月13日

谷歌提出新分类损失函数：将噪声对训练结果影响降到最低

谷歌提出新分类损失函数：将噪声对训练结果影响降到最低

量子位

8+阅读 · 2019年8月28日

如何优化你的图像分类模型效果？

如何优化你的图像分类模型效果？

AI研习社

4+阅读 · 2019年5月26日

280万样本！谷歌开放史上最大分割掩码数据集，开启新一轮挑战赛

280万样本！谷歌开放史上最大分割掩码数据集，开启新一轮挑战赛

极市平台

4+阅读 · 2019年5月10日

10亿级数据规模的半监督图像分类模型，Imagenet测试精度高达81.2％ | 技术头条

10亿级数据规模的半监督图像分类模型，Imagenet测试精度高达81.2％ | 技术头条

AI100

7+阅读 · 2019年5月7日

图像分类算法优化技巧：Bag of Tricks for Image Classification

图像分类算法优化技巧：Bag of Tricks for Image Classification

极市平台

9+阅读 · 2018年12月28日

【干货】Yann Lecun自监督学习指南（附114页Slides全文）

【干货】Yann Lecun自监督学习指南（附114页Slides全文）

GAN生成式对抗网络

94+阅读 · 2018年12月19日

Yann Lecun自监督学习指南（附114页Slides全文下载）

Yann Lecun自监督学习指南（附114页Slides全文下载）

专知

53+阅读 · 2018年12月19日

用这种方法实现无监督端到端图像分类！（附论文）

用这种方法实现无监督端到端图像分类！（附论文）

数据派THU

8+阅读 · 2018年8月10日

IBM新论文|SamplePairing：针对图像处理领域的高效数据增强方式

IBM新论文|SamplePairing：针对图像处理领域的高效数据增强方式

极市平台

16+阅读 · 2018年1月20日

相关论文

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Self-training with Noisy Student improves ImageNet classification

Arxiv

15+阅读 · 2019年11月11日

Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Arxiv

7+阅读 · 2019年2月8日

Rethinking ImageNet Pre-training

Arxiv

8+阅读 · 2018年11月21日

Few Shot Learning with Simplex

Few Shot Learning with Simplex

Arxiv

5+阅读 · 2018年7月27日

Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

Arxiv

3+阅读 · 2018年6月26日

Learning Instance Segmentation by Interaction

Arxiv

6+阅读 · 2018年6月21日

Noise2Noise: Learning Image Restoration without Clean Data

Arxiv

5+阅读 · 2018年3月12日

TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation

Arxiv

5+阅读 · 2018年1月17日

Train Once, Test Anywhere: Zero-Shot Learning for Text Classification

Arxiv

4+阅读 · 2017年12月23日

微信扫码咨询专知VIP会员