Decoupled Training for Long-Tailed Classification With Stochastic Representations (Decoupled Training for Long-Tailed Classification With Stochastic Representations)

Decoupling representation learning and classifier learning has been shown to be effective in classification with long-tailed data. There are two main ingredients in constructing a decoupled learning scheme; 1) how to train the feature extractor for representation learning so that it provides generalizable representations and 2) how to re-train the classifier that constructs proper decision boundaries by handling class imbalances in long-tailed data. In this work, we first apply Stochastic Weight Averaging (SWA), an optimization technique for improving the generalization of deep neural networks, to obtain better generalizing feature extractors for long-tailed classification. We then propose a novel classifier re-training algorithm based on stochastic representation obtained from the SWA-Gaussian, a Gaussian perturbed SWA, and a self-distillation strategy that can harness the diverse stochastic representations based on uncertainty estimates to build more robust classifiers. Extensive experiments on CIFAR10/100-LT, ImageNet-LT, and iNaturalist-2018 benchmarks show that our proposed method improves upon previous methods both in terms of prediction accuracy and uncertainty estimation.

翻译：随机表示的长尾分类解耦训练解耦表示学习和分类器学习在长尾数据分类中被证明是有效的。构建解耦学习方案有两个主要因素：1）如何训练特征提取器以进行表示学习，使其提供可推广的表示；2）如何通过处理长尾数据中的类别不平衡来重新训练构建适当决策边界的分类器。在本研究中，我们首先应用用于改善深度神经网络泛化的优化技术随机加权平均（SWA）来获得用于长尾分类的更好的泛化特征提取器。然后，我们提出一种基于SWA-Gaussian（一种高斯扰动的SWA）和自蒸馏策略的新型分类器重新训练算法，该算法可以利用基于不确定性估计的多样随机表示来构建更健壮的分类器。在CIFAR10/100-LT，ImageNet-LT和iNaturalist-2018基准测试上的广泛实验表明，我们提出的方法在预测准确性和不确定性估计方面都优于先前的方法。

相关内容

分类器

关注 6

分类是数据挖掘的一种非常重要的方法。分类的概念是在已有数据的基础上学会一个分类函数或构造出一个分类模型（即我们通常所说的分类器(Classifier)）。该函数或模型能够把数据库中的数据纪录映射到给定类别中的某一个，从而可以应用于数据预测。总之，分类器是数据挖掘中对样本进行分类的方法的统称，包含决策树、逻辑回归、朴素贝叶斯、神经网络等算法。

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【领域对抗学习的低资源文本分类】Low-Resource Text Classification using Domain-Adversarial Learning

专知会员服务

23+阅读 · 2020年4月22日