We present HARP (HAnd Reconstruction and Personalization), a personalized hand avatar creation approach that takes a short monocular RGB video of a human hand as input and reconstructs a faithful hand avatar exhibiting a high-fidelity appearance and geometry. In contrast to the major trend of neural implicit representations, HARP models a hand with a mesh-based parametric hand model, a vertex displacement map, a normal map, and an albedo without any neural components. As validated by our experiments, the explicit nature of our representation enables a truly scalable, robust, and efficient approach to hand avatar creation. HARP is optimized via gradient descent from a short sequence captured by a hand-held mobile phone and can be directly used in AR/VR applications with real-time rendering capability. To enable this, we carefully design and implement a shadow-aware differentiable rendering scheme that is robust to high degree articulations and self-shadowing regularly present in hand motion sequences, as well as challenging lighting conditions. It also generalizes to unseen poses and novel viewpoints, producing photo-realistic renderings of hand animations performing highly-articulated motions. Furthermore, the learned HARP representation can be used for improving 3D hand pose estimation quality in challenging viewpoints. The key advantages of HARP are validated by the in-depth analyses on appearance reconstruction, novel-view and novel pose synthesis, and 3D hand pose refinement. It is an AR/VR-ready personalized hand representation that shows superior fidelity and scalability.
翻译:我们展示了HARP(HAND)(HAND 重建和个性化),一种个性化的手动动动动器创建方法,它以人的手作为输入,用一个短的单镜 RGB 视频,并重建了一个忠实的手动动动动器,展示出高不忠的外观和几何特征。与神经隐含表达的主要趋势相反,HARP用一个以网状为基础的模拟手模模型、一个顶部转移图、一个正常的地图和一个没有神经元构件的反光仪来模拟手动的反光仪。正如我们实验所证实的那样,我们的代表面的清晰性能使得能够真正具有伸缩性、强、高效的反射手动器创建。HARP通过由手持个人手机拍摄的短顺序从梯度下下降,并直接用于具有实时显示能力的AR/VR应用程序应用中。我们仔细设计和实施一个有阴影的、有差异的投影化的投影制方案,它定期出现在手动的顺序中,以及具有挑战性的照明条件。它也可以概括地概括地和新式的直观观点,通过手动的变的图像的图像的更精确的变现,在不断的图像上展示中展示中展示的图像的变现的图像的图像的图像的自我演化分析中,可以展示的自我演进式的自我演进式的自我演进式的优势。