Real-world data often follows a long-tailed distribution, which makes the performance of existing classification algorithms degrade heavily. A key issue is that samples in tail categories fail to depict their intra-class diversity. Humans can imagine a sample in new poses, scenes, and view angles with their prior knowledge even if it is the first time to see this category. Inspired by this, we propose a novel reasoning-based implicit semantic data augmentation method to borrow transformation directions from other classes. Since the covariance matrix of each category represents the feature transformation directions, we can sample new directions from similar categories to generate definitely different instances. Specifically, the long-tailed distributed data is first adopted to train a backbone and a classifier. Then, a covariance matrix for each category is estimated, and a knowledge graph is constructed to store the relations of any two categories. Finally, tail samples are adaptively enhanced via propagating information from all the similar categories in the knowledge graph. Experimental results on CIFAR-100-LT, ImageNet-LT, and iNaturalist 2018 have demonstrated the effectiveness of our proposed method compared with the state-of-the-art methods.
翻译:现实世界数据往往遵循长期的分布法,使得现有分类算法的性能严重退化。一个关键问题是尾类样本无法描述其阶级内部的多样性。人类可以想象出一个样本,以新的外观、场景和视觉角度,即使这是第一次看到这一类别。受此启发,我们提出了一个基于推理的隐含语义数据增强方法,以从其他类别中借用转换方向。由于每个类别的共变矩阵代表特征转换方向,我们可以从类似类别中抽取新的方向,以产生明显不同的例子。具体地说,长尾类分布数据首先用于培训骨干和分类者。然后,估计了每种类别的共变式矩阵,并构建了一个知识图以储存任何两类的关系。最后,通过传播知识图表中所有同类类别的信息,尾样经过适应性地得到加强。CIFAR-100-LT、图像网-LT和iNatallist 2018的实验结果证明了我们拟议方法与最新方法相比的有效性。