Most existing state-of-the-art video classification methods assume that the training data obey a uniform distribution. However, video data in the real world typically exhibit an imbalanced long-tailed class distribution, resulting in a model bias towards head class and relatively low performance on tail class. While the current long-tailed classification methods usually focus on image classification, adapting it to video data is not a trivial extension. We propose an end-to-end multi-expert distribution calibration method to address these challenges based on two-level distribution information. The method jointly considers the distribution of samples in each class (intra-class distribution) and the overall distribution of diverse data (inter-class distribution) to solve the issue of imbalanced data under long-tailed distribution. By modeling the two-level distribution information, the model can jointly consider the head classes and the tail classes and significantly transfer the knowledge from the head classes to improve the performance of the tail classes. Extensive experiments verify that our method achieves state-of-the-art performance on the long-tailed video classification task.
翻译:多数现有最先进的视频分类方法假定培训数据符合统一分布。然而,现实世界中的视频数据通常呈现不平衡的长尾类分布,导致对头类的模范偏向,尾类的性能相对较低。虽然目前的长尾类分类方法通常侧重于图像分类,但将其调整为视频数据并不是一个微不足道的延伸。我们根据两级分布信息提出了一个端到端的多专家分配校准方法,以应对这些挑战。该方法共同考虑每个类的样本分布(类内部分布)和不同数据的总体分布(类间分布),以解决长尾类分布中的不平衡数据问题。模型通过建模两个层次的分布信息,可以共同考虑头类和尾类,并大量从头类中转让知识,以提高尾类的性能。广泛的实验证实我们的方法在长尾类视频分类工作中取得了最新业绩。