With the recent success of deep neural networks, remarkable progress has been achieved on face recognition. However, collecting large-scale real-world training data for face recognition has turned out to be challenging, especially due to the label noise and privacy issues. Meanwhile, existing face recognition datasets are usually collected from web images, lacking detailed annotations on attributes (e.g., pose and expression), so the influences of different attributes on face recognition have been poorly investigated. In this paper, we address the above-mentioned issues in face recognition using synthetic face images, i.e., SynFace. Specifically, we first explore the performance gap between recent state-of-the-art face recognition models trained with synthetic and real face images. We then analyze the underlying causes behind the performance gap, e.g., the poor intra-class variations and the domain gap between synthetic and real face images. Inspired by this, we devise the SynFace with identity mixup (IM) and domain mixup (DM) to mitigate the above performance gap, demonstrating the great potentials of synthetic data for face recognition. Furthermore, with the controllable face synthesis model, we can easily manage different factors of synthetic face generation, including pose, expression, illumination, the number of identities, and samples per identity. Therefore, we also perform a systematically empirical analysis on synthetic face images to provide some insights on how to effectively utilize synthetic data for face recognition.
翻译:最近,随着深层神经网络的成功,在面部识别方面取得了显著进展。然而,为面部识别而收集大规模真实世界培训数据证明,特别是由于标签噪音和隐私问题,为面部识别而收集的大规模真实世界培训数据具有挑战性。与此同时,现有的面部识别数据集通常从网络图像中收集,缺乏关于特征的详细说明(如脸部和表情),因此,不同属性对面部识别的影响调查不力。在本文件中,我们用合成面部图像(即SynFace)进行面部识别,解决上述问题。具体地说,我们首先探索了最近以合成图像和真实图像培训的最先进的面部识别模型之间的性能差距。我们随后分析了业绩差距背后的根本原因,例如,分类内部变化差以及合成图像与真实图像之间的领域差距。我们为此设计了身份混杂的SynFace(IM)和域组合(DM)来缓解上述业绩差距,展示了合成数据对面部识别的巨大潜力。此外,通过可控面部合成图像模型,我们能够对合成图像的不同面部、合成图像进行系统化分析,我们也可以对合成图像进行系统化分析,包括合成图像的合成模型分析。