In deep metric learning (DML), high-level input data are represented in a lower-level representation (embedding) space, such that samples from the same class are mapped close together, while samples from disparate classes are mapped further apart. In this lower-level representation, only a single inference sample from each known class is required to discriminate between classes accurately. The features a DML model uses to discriminate between classes and the importance of each feature in the training process are unknown. To investigate this, this study takes embeddings trained to discriminate faces (identities) and uses unsupervised clustering to identify the features involved in facial identity discrimination by examining their representation within the embedded space. This study is split into two cases; intra class sub-discrimination, where attributes that differ between a single identity are considered; such as beards and emotions; and extra class sub-discrimination, where attributes which differ between different identities/people, are considered; such as gender, skin tone and age. In the intra class scenario, the inference process distinguishes common attributes between single identities, achieving 90.0\% and 76.0\% accuracy for beards and glasses, respectively. The system can also perform extra class sub-discrimination with a high accuracy rate, notably 99.3\%, 99.3\% and 94.1\% for gender, skin tone, and age, respectively.
翻译:在深度度量学习(DML)中,高级输入数据被表示为更低级的表示(嵌入)空间,使得来自同一类别的样本相互靠近,而来自不同类别的样本相互远离。在这个较低的表示空间中,只需要一个已知类别的单个推理样本即可在类别之间进行准确区分。DML模型用于区分类别的特征以及每个特征在训练过程中的重要性是未知的。为了调查这一点,本研究采用为识别面部(身份)而训练的嵌入,并使用无监督聚类来识别与面部身份识别相关的特征,通过考察它们在嵌入空间内的表示。该研究分为两种情况:类内子判别和类外子判别,在前者中,考虑在单个身份之间不同的属性,如胡子和情绪;在后者中,则考虑诸如性别、肤色和年龄等不同身份/人之间的属性。在内部类别的情况下,推理过程区分单个身份之间的共同属性,实现了90.0%和76.0%的胡子和眼镜识别准确率。该系统还可以高精度率地执行类外子判别,尤其是对于性别、肤色和年龄分别达到了99.3%、99.3%和94.1%的高准确率。