Occlusions often occur in face images in the wild, troubling face-related tasks such as landmark detection, 3D reconstruction, and face recognition. It is beneficial to extract face regions from unconstrained face images accurately. However, current face segmentation datasets suffer from small data volumes, few occlusion types, low resolution, and imprecise annotation, limiting the performance of data-driven-based algorithms. This paper proposes a novel face occlusion dataset with manually labeled face occlusions from the CelebA-HQ and the internet. The occlusion types cover sunglasses, spectacles, hands, masks, scarfs, microphones, etc. To the best of our knowledge, it is by far the largest and most comprehensive face occlusion dataset. Combining it with the attribute mask in CelebAMask-HQ, we trained a straightforward face segmentation model but obtained SOTA performance, convincingly demonstrating the effectiveness of the proposed dataset.
翻译:在野外的面部图像中往往会出现与面部有关的封闭现象,如里程碑探测、3D重建以及面部识别等,这有利于准确地从未受限制的面部图像中提取面部区域。然而,目前的面部分割数据集存在数据量小、隔离类型少、分辨率低和注解不精确的情况,限制了数据驱动算法的性能。本文提出一个新的面部隔离数据集,由CelebA-HQ和互联网上人工标注的面部隔离现象组成。隐蔽类型包括太阳镜、眼镜、手、面罩、围巾、麦克风等。据我们所知,它远为最大和最全面的面部隔离数据集。将其与CelebAMsk-HQ的属性掩码结合起来,我们训练了一个直接的面部隔离模型,但获得了SOTA的性能,令人信服地展示了拟议数据集的有效性。