Fine-grained semantic segmentation of a person's face and head, including facial parts and head components, has progressed a great deal in recent years. However, it remains a challenging task, whereby considering ambiguous occlusions and large pose variations are particularly difficult. To overcome these difficulties, we propose a novel framework termed Mask-FPAN. It uses a de-occlusion module that learns to parse occluded faces in a semi-supervised way. In particular, face landmark localization, face occlusionstimations, and detected head poses are taken into account. A 3D morphable face model combined with the UV GAN improves the robustness of 2D face parsing. In addition, we introduce two new datasets named FaceOccMask-HQ and CelebAMaskOcc-HQ for face paring work. The proposed Mask-FPAN framework addresses the face parsing problem in the wild and shows significant performance improvements with MIOU from 0.7353 to 0.9013 compared to the state-of-the-art on challenging face datasets.
翻译:近些年来,对一个人的脸和头部,包括面部部分和头部部分的精密语解剖取得了很大进展,然而,这仍然是一项具有挑战性的任务,根据这一任务,考虑模糊的排斥和巨大的构成差异尤其困难。为了克服这些困难,我们提议了一个叫作Mask-FPAN的新框架。它使用一种以半监督方式分析隐蔽面部的排除语解模块。特别是,将脸部标志化、面部隔离和检测到的头部姿势考虑在内。一个3D可变形面貌模型与UV GAN相结合,提高了2D面部面部剖面的坚固性。此外,我们引入了两个新的数据集,分别是FaceOcc-HQ和CerebAMask-HQ,用于面部工作。拟议的Mac-FPAN框架处理野生面部的分辨问题,并显示与MIO的性能显著改善,从0.7353到0.9013,而与具有挑战性面部的状态数据集相比,从0.9013到0.9013。</s>