Reconstructing high-fidelity 3D head avatars is crucial in various applications such as virtual reality. The pioneering methods reconstruct realistic head avatars with Neural Radiance Fields (NeRF), which have been limited by training and rendering speed. Recent methods based on 3D Gaussian Splatting (3DGS) significantly improve the efficiency of training and rendering. However, the surface inconsistency of 3DGS results in subpar geometric accuracy; later, 2DGS uses 2D surfels to enhance geometric accuracy at the expense of rendering fidelity. To leverage the benefits of both 2DGS and 3DGS, we propose a novel method named MixedGaussianAvatar for realistically and geometrically accurate head avatar reconstruction. Our main idea is to utilize 2D Gaussians to reconstruct the surface of the 3D head, ensuring geometric accuracy. We attach the 2D Gaussians to the triangular mesh of the FLAME model and connect additional 3D Gaussians to those 2D Gaussians where the rendering quality of 2DGS is inadequate, creating a mixed 2D-3D Gaussian representation. These 2D-3D Gaussians can then be animated using FLAME parameters. We further introduce a progressive training strategy that first trains the 2D Gaussians and then fine-tunes the mixed 2D-3D Gaussians. We use a unified mixed Gaussian representation to integrate the two modalities of 2D image and 3D mesh. Furthermore, the comprehensive experiments demonstrate the superiority of MixedGaussianAvatar. The code will be released.
翻译:重建高保真度的3D头部化身在虚拟现实等应用中至关重要。开创性方法使用神经辐射场(NeRF)重建真实的头部化身,但受限于训练和渲染速度。基于3D高斯泼溅(3DGS)的最新方法显著提升了训练和渲染效率。然而,3DGS的表面不一致性导致几何精度不足;后续的2DGS使用2D面元提升几何精度,但牺牲了渲染保真度。为兼顾2DGS和3DGS的优势,我们提出了一种名为MixedGaussianAvatar的新方法,用于实现真实且几何精确的头部化身重建。核心思路是利用2D高斯重建3D头部表面以确保几何精度。我们将2D高斯附着于FLAME模型的三角网格,并在2DGS渲染质量不足的区域连接额外的3D高斯,形成混合2D-3D高斯表示。这些2D-3D高斯可通过FLAME参数进行动画驱动。我们进一步提出渐进式训练策略:先训练2D高斯,再微调混合2D-3D高斯。通过统一的混合高斯表示,我们整合了2D图像与3D网格两种模态。综合实验证明了MixedGaussianAvatar的优越性。代码将公开发布。