Long-tailed image recognition presents massive challenges to deep learning systems since the imbalance between majority (head) classes and minority (tail) classes severely skews the data-driven deep neural networks. Previous methods tackle with data imbalance from the viewpoints of data distribution, feature space, and model design, etc.In this work, instead of directly learning a recognition model, we suggest confronting the bottleneck of head-to-tail bias before classifier learning, from the previously omitted perspective of balancing label space. To alleviate the head-to-tail bias, we propose a concise paradigm by progressively adjusting label space and dividing the head classes and tail classes, dynamically constructing balance from imbalance to facilitate the classification. With flexible data filtering and label space mapping, we can easily embed our approach to most classification models, especially the decoupled training methods. Besides, we find the separability of head-tail classes varies among different features with different inductive biases. Hence, our proposed model also provides a feature evaluation method and paves the way for long-tailed feature learning. Extensive experiments show that our method can boost the performance of state-of-the-arts of different types on widely-used benchmarks. Code is available at https://github.com/silicx/DLSA.
翻译:长尾图像识别对深层次学习系统提出了巨大的挑战,因为多数(头)类和少数(尾)类之间的不平衡严重扭曲了数据驱动的深神经网络。以往的方法从数据分布、特征空间和模型设计等角度处理数据不平衡问题。 在这项工作中,我们建议从先前忽略的平衡标签空间的角度,在叙级学习之前,面对头到尾偏差的瓶颈。为了减轻头到尾的偏差,我们提出了一个简明的范例,逐步调整标签空间,区分头类和尾类,动态地构建平衡,以方便分类。通过灵活的数据过滤和标签空间绘图,我们可以很容易地将我们的方法嵌入大多数分类模型,特别是分解的培训方法。此外,我们发现头尾班在不同的特征之间有差异,有不同的诱导偏差。因此,我们提议的模型还提供了一个特征评估方法,并为长尾的特征学习铺平了道路。广泛的实验显示,我们的方法可以提高州-州/州/州/州/州/州/州/州级标准。