Recovery of a 3D head model including the complete face and hair regions is still a challenging problem in computer vision and graphics. In this paper, we consider this problem using only a few multi-view portrait images as input. Previous multi-view stereo methods that have been based, either on optimization strategies or deep learning techniques, suffer from low-frequency geometric structures such as unclear head structures and inaccurate reconstruction in hair regions. To tackle this problem, we propose a prior-guided implicit neural rendering network. Specifically, we model the head geometry with a learnable signed distance field (SDF) and optimize it via an implicit differentiable renderer with the guidance of some human head priors, including the facial prior knowledge, head semantic segmentation information and 2D hair orientation maps. The utilization of these priors can improve the reconstruction accuracy and robustness, leading to a high-quality integrated 3D head model. Extensive ablation studies and comparisons with state-of-the-art methods demonstrate that our method can generate high-fidelity 3D head geometries with the guidance of these priors.
翻译:包括完整的脸部和毛发区域在内的3D头型模型的恢复仍然是计算机视觉和图形中一个具有挑战性的问题。 在本文中,我们仅使用少数多视图肖像图像作为投入来看待这一问题。以前基于优化战略或深层学习技术的多视图立体法受到低频几何结构的影响,如头部结构不明和毛发区域重建不准确。为了解决这个问题,我们提议了先导的隐性神经造影网络。具体地说,我们用可学习的签名距离字段(SDF)来模拟头部几何方法,并通过一个隐含的可辨别成像器来优化它,并遵循某些人类头部前科的指导,包括面部先前的知识、头部语分解信息以及2D发型方向图。这些前述的利用可以提高重建的准确性和稳健性,从而形成高质量的3D综合头型模型。 广泛进行对比研究,并与国家技术方法进行比较,表明我们的方法可以产生高不直观的3D头形地形。