In this work, we address the challenging task of long-tailed image recognition. Previous long-tailed recognition methods commonly focus on the data augmentation or re-balancing strategy of the tail classes to give more attention to tail classes during the model training. However, due to the limited training images for tail classes, the diversity of tail class images is still restricted, which results in poor feature representations. In this work, we hypothesize that common latent features among the head and tail classes can be used to give better feature representation. Motivated by this, we introduce a Latent Categories based long-tail Recognition (LCReg) method. Specifically, we propose to learn a set of class-agnostic latent features shared among the head and tail classes. Then, we implicitly enrich the training sample diversity via applying semantic data augmentation to the latent features. Extensive experiments on five long-tailed image recognition datasets demonstrate that our proposed LCReg is able to significantly outperform previous methods and achieve state-of-the-art results.
翻译:在这项工作中,我们处理长尾类图像识别的艰巨任务。 以往长尾类识别方法通常侧重于尾类数据增强或再平衡策略,以便在模型培训期间更多地关注尾类课程。 但是,由于尾类类培训图像有限,尾类图像的多样性仍然受到限制,导致特征表现不佳。 在这项工作中,我们假设头类和尾类的共同潜伏特征可以用来提供更好的特征描述。 受此驱动,我们采用了基于低尾类长尾类识别(LCReg)方法。 具体地说,我们提议学习一组在头类和尾类之间共享的类不可知潜在特征。 然后,我们通过对潜在特征应用语义数据增强来隐含地丰富培训样本的多样性。 对五套长成图像识别数据集进行的广泛实验表明,我们拟议的 LCREg 能够大大超越以往方法,并取得最新结果。