Recently, different researchers have found that the gallery composition of a face database can induce performance differentials to facial identification systems in which a probe image is compared against up to all stored reference images to reach a biometric decision. This negative effect is referred to as "watchlist imbalance effect". In this work, we present a method to theoretically estimate said effect for a biometric identification system given its verification performance across demographic groups and the composition of the used gallery. Further, we report results for identification experiments on differently composed demographic subsets, i.e. females and males, of the public academic MORPH database using the open-source ArcFace face recognition system. It is shown that the database composition has a huge impact on performance differentials in biometric identification systems, even if performance differentials are less pronounced in the verification scenario. This study represents the first detailed analysis of the watchlist imbalance effect which is expected to be of high interest for future research in the field of facial recognition.
翻译:最近,不同研究人员发现,面部数据库的画廊构成可导致面部识别系统出现性能差异,将探测图像与所有存储的参考图像进行比较,以得出生物鉴别决定。这种负面效应被称为“观察列表不平衡效应 ” 。 在这项工作中,我们提出了一个方法,根据生物鉴别系统的跨人口组的核查性能和所用画廊的构成情况,从理论上估算生物鉴别系统的上述效果。此外,我们还报告了使用开放源的ArcFace面部识别系统对公共学术MORPH数据库中不同构成的人口子集(即女性和男性)进行身份验证实验的结果。 结果表明,数据库的构成对生物鉴别系统的性能差异有着巨大影响,即便在核查设想中表现差异不大明显。 这项研究是对观察列表不平衡效应的首次详细分析,预计将对今后在面部识别领域的研究具有高度兴趣。