Fine-grained semantic segmentation of a person's face and head, including facial parts and head components, has progressed a great deal in recent years. However, it remains a challenging task, whereby considering ambiguous occlusions and large pose variations are particularly difficult. To overcome these difficulties, we propose a novel framework termed Mask-FPAN. It uses a de-occlusion module that learns to parse occluded faces in a semi-supervised way. In particular, face landmark localization, face occlusionstimations, and detected head poses are taken into account. A 3D morphable face model combined with the UV GAN improves the robustness of 2D face parsing. In addition, we introduce two new datasets named FaceOccMask-HQ and CelebAMaskOcc-HQ for face paring work. The proposed Mask-FPAN framework addresses the face parsing problem in the wild and shows significant performance improvements with MIOU from 0.7353 to 0.9013 compared to the state-of-the-art on challenging face datasets.
翻译:人的面部和头部,包括面部各部分和头部构件的细粒度语义分割在最近几年已取得了很大进展。但是,考虑到挡住的面部以及很大的姿势变化,这仍然是一个具有挑战性的任务。为了克服这些困难,我们提出了一种新的框架,称为 Mask-FPAN。 它使用了一个去遮挡模块,以半监督的方式学习解析被遮挡的面部。 特别地,考虑到面部landmark定位,面部遮挡估计和检测到的头部姿势。 结合UV GAN的3D可形变面部模型提高了2D面部解析的鲁棒性。此外,我们还推出了两个新数据集,名为FaceOccMask-HQ和CelebAMaskOcc-HQ,用于人脸解析工作。提出的Mask-FPAN框架解决了野外人脸分割问题,并在具有挑战性的人脸数据集上显示了显著的性能改善,MIOU从0.7353提高到0.9013,相对于最先进技术的结果。