Most classification models treat all misclassifications equally. However, different classes may be related, and these hierarchical relationships must be considered in some classification problems. These problems can be addressed by using hierarchical information during training. Unfortunately, this information is not available for all datasets. Many classification-based metric learning methods use class representatives in embedding space to represent different classes. The relationships among the learned class representatives can then be used to estimate class hierarchical structures. If we have a predefined class hierarchy, the learned class representatives can be assessed to determine whether the metric learning model learned semantic distances that match our prior knowledge. In this work, we train a softmax classifier and three metric learning models with several training options on benchmark and real-world datasets. In addition to the standard classification accuracy, we evaluate the hierarchical inference performance by inspecting learned class representatives and the hierarchy-informed performance, i.e., the classification performance, and the metric learning performance by considering predefined hierarchical structures. Furthermore, we investigate how the considered measures are affected by various models and training options. When our proposed ProxyDR model is trained without using predefined hierarchical structures, the hierarchical inference performance is significantly better than that of the popular NormFace model. Additionally, our model enhances some hierarchy-informed performance measures under the same training options. We also found that convolutional neural networks (CNNs) with random weights correspond to the predefined hierarchies better than random chance.
翻译:大多数分类模式处理所有的分类错误。 但是,不同的类别可能相互关联,而且必须在某些分类问题中考虑这些等级关系。这些问题可以通过在培训过程中使用等级信息加以解决。 不幸的是,并非所有数据集都可获得这种信息。许多基于分类的衡量学习方法都使用班级代表来嵌入空间以代表不同类别。然后,可以使用学习的班级代表之间的关系来估计等级结构。如果我们有一个预先定义的等级结构,那么,可以评估学习的班级代表,以确定衡量标准学习模式所学的语义距离是否与我们先前的知识相匹配。在这项工作中,我们培训一个软式的分类器和三个衡量学习模式,在基准和现实世界数据集方面有若干培训选项。除了标准分类准确性外,我们通过检查学习的班级代表来评估等级推论表现,以及了解等级结构的绩效,即分类性能,以及通过考虑预先定义的等级结构,我们研究考虑的计量尺度是否受到各种模式和培训选项的影响。当我们提议的SproxDR模型在不使用事先定义的等级结构结构结构和真实性数据的基础上加以培训时,我们所选择的等级结构中的等级结构会大大改进。 我们的等级结构中的等级性能提高我们所发现的等级结构的成绩。