The evaluation of 3D face reconstruction results typically relies on a rigid shape alignment between the estimated 3D model and the ground-truth scan. We observe that aligning two shapes with different reference points can largely affect the evaluation results. This poses difficulties for precisely diagnosing and improving a 3D face reconstruction method. In this paper, we propose a novel evaluation approach with a new benchmark REALY, consists of 100 globally aligned face scans with accurate facial keypoints, high-quality region masks, and topology-consistent meshes. Our approach performs region-wise shape alignment and leads to more accurate, bidirectional correspondences during computing the shape errors. The fine-grained, region-wise evaluation results provide us detailed understandings about the performance of state-of-the-art 3D face reconstruction methods. For example, our experiments on single-image based reconstruction methods reveal that DECA performs the best on nose regions, while GANFit performs better on cheek regions. Besides, a new and high-quality 3DMM basis, HIFI3D++, is further derived using the same procedure as we construct REALY to align and retopologize several 3D face datasets. We will release REALY, HIFI3D++, and our new evaluation pipeline at https://realy3dface.com.
翻译:3D的重建成果评估通常取决于估计的 3D 模型和地面真相扫描之间的僵硬形状组合。我们观察到,将两种形状与不同的参考点相匹配,可能会在很大程度上影响评价结果。这给准确诊断和改进3D的重建方法带来了困难。在本文件中,我们提出一个新的基准“RealY”的新评价方法,由100个全球一致的面部扫描组成,具有准确的面部关键点、高质量的区域面具和符合地形特征的 meshes。此外,我们的方法在计算形状错误时,对区域进行明智的形状调整,导致更准确的双向对应。精细精细的、对区域的评价结果为我们详细了解了3D面临的重建方法。例如,我们基于重建方法的单一图像实验显示,DECA在鼻子区域表现最佳,而GANFitit在脸部区域表现更好。此外,在计算形状错误时,HIF3 3 将进一步用同样程序推算出我们MEREY+D3的版本, ASDADR3 将调整和RDADAD 3 新的数据升级。