In this paper, we evaluate the effects of occlusions in the performance of a face recognition pipeline that uses a ResNet backbone. The classifier was trained on a subset of the CelebA-HQ dataset containing 5,478 images from 307 classes, to achieve top-1 error rate of 17.91%. We designed 8 different occlusion masks which were applied to the input images. This caused a significant drop in the classifier performance: its error rate for each mask became at least two times worse than before. In order to increase robustness under occlusions, we followed two approaches. The first is image inpainting using the pre-trained pluralistic image completion network. The second is Cutmix, a regularization strategy consisting of mixing training images and their labels using rectangular patches, making the classifier more robust against input corruptions. Both strategies revealed effective and interesting results were observed. In particular, the Cutmix approach makes the network more robust without requiring additional steps at the application time, though its training time is considerably longer. Our datasets containing the different occlusion masks as well as their inpainted counterparts are made publicly available to promote research on the field.
翻译:在本文中,我们评估了使用 ResNet 主干网的面部识别管道性能中的隔离效应。 分类器在包含 307 类5 478 图像的CelebA- HQ 数据集子集上接受了培训, 以达到17.91%的顶层-1误差率。 我们设计了8个不同的封闭面罩, 用于输入图像。 这导致分类器性能显著下降: 每个面罩的误差率至少比以前差2倍。 为了提高隔离下的稳健性, 我们遵循了两种方法。 第一个是使用预先训练的多元图像完成网络的图像涂漆。 第二个是 Cutmix, 由使用矩形补丁混合培训图像及其标签组成的正规化战略, 使分类器更有力地防止输入的腐败。 这两种策略都揭示了有效和有趣的结果。 特别是, 使用Cutmix 方法使得网络在应用时间不需要额外步骤的情况下更加稳健。 尽管培训时间要长得多。 我们的数据集包含不同的隐蔽面罩, 以及其内侧侧侧对等的对口单位公开进行研究。