Curvature-Balanced Feature Manifold Learning for Long-Tailed Classification (Curvature-Balanced Feature Manifold Learning for Long-Tailed Classification)

To address the challenges of long-tailed classification, researchers have proposed several approaches to reduce model bias, most of which assume that classes with few samples are weak classes. However, recent studies have shown that tail classes are not always hard to learn, and model bias has been observed on sample-balanced datasets, suggesting the existence of other factors that affect model bias. In this work, we systematically propose a series of geometric measurements for perceptual manifolds in deep neural networks, and then explore the effect of the geometric characteristics of perceptual manifolds on classification difficulty and how learning shapes the geometric characteristics of perceptual manifolds. An unanticipated finding is that the correlation between the class accuracy and the separation degree of perceptual manifolds gradually decreases during training, while the negative correlation with the curvature gradually increases, implying that curvature imbalance leads to model bias. Therefore, we propose curvature regularization to facilitate the model to learn curvature-balanced and flatter perceptual manifolds. Evaluations on multiple long-tailed and non-long-tailed datasets show the excellent performance and exciting generality of our approach, especially in achieving significant performance improvements based on current state-of-the-art techniques. Our work opens up a geometric analysis perspective on model bias and reminds researchers to pay attention to model bias on non-long-tailed and even sample-balanced datasets. The code and model will be made public.

翻译：面向长尾分类的曲率平衡特征流形学习为了解决长尾分类的挑战，研究人员提出了减少模型偏差的几种方法，其中大多数方法假定样本较少的类别是弱类别。然而，最近的研究表明，尾部类别并不总是难以学习的，而在样本平衡的数据集上观察到模型偏差，表明存在其他影响模型偏差的因素。在这项工作中，我们系统地提出了一系列深度神经网络感知流形的几何度量，并探究了感知流形的几何特征对分类难度的影响，以及学习如何塑造感知流形的几何特征。一个出乎意料的发现是，类别精度与感知流形的分离度之间的相关性在训练过程中逐渐降低，而与曲率的负相关性逐渐增加，这表明曲率不平衡导致了模型偏差。因此，我们提出了曲率正则化来促进模型学习曲率平衡和更平整的感知流形。在多个长尾和非长尾数据集上的评估表明，我们的方法具有出色的性能和令人兴奋的通用性，尤其是在当前最先进技术的基础上实现了显著的性能提升。我们的工作为模型偏差开辟了几何分析视角，并提醒研究人员关注在非长尾甚至样本平衡的数据集上的模型偏差。代码和模型将公开发布。