Over the past years, the main research innovations in face recognition focused on training deep neural networks on large-scale identity-labeled datasets using variations of multi-class classification losses. However, many of these datasets are retreated by their creators due to increased privacy and ethical concerns. Very recently, privacy-friendly synthetic data has been proposed as an alternative to privacy-sensitive authentic data to comply with privacy regulations and to ensure the continuity of face recognition research. In this paper, we propose an unsupervised face recognition model based on unlabeled synthetic data (USynthFace). Our proposed USynthFace learns to maximize the similarity between two augmented images of the same synthetic instance. We enable this by a large set of geometric and color transformations in addition to GAN-based augmentation that contributes to the USynthFace model training. We also conduct numerous empirical studies on different components of our USynthFace. With the proposed set of augmentation operations, we proved the effectiveness of our USynthFace in achieving relatively high recognition accuracies using unlabeled synthetic data.
翻译:过去几年来,面对识别的主要研究创新侧重于利用多级分类损失的变异,对大型身份标签数据集的深神经网络进行培训。然而,由于隐私和伦理问题增多,许多这类数据集被创建者退缩。最近,提出了方便隐私的合成数据,作为隐私敏感真实数据的替代办法,以遵守隐私条例,并确保面貌识别研究的连续性。在本文件中,我们提出了一个基于无标签合成数据(USynthFace)的未经监督的面孔识别模型。我们提议的USynthFace学会了尽可能扩大同一合成实例中两种强化图像之间的相似性。我们除了通过有助于USynthFace模型培训的基于GAN的增强外,还促成大量地貌和颜色的转换。我们还就我们的USynthFace模型的不同组成部分进行了许多实证研究。我们提议的增强操作组证明了我们的USynthFace在利用未贴标签合成数据实现相对高的识别度理解方面的有效性。