The human hand is the main medium through which we interact with our surroundings, making its digitization an important problem. Hence, its digitization is of uttermost importance, with direct applications in VR/AR, gaming, and media production amongst other areas. While there are several works modeling the geometry of hands, little attention has been paid to capturing photo-realistic appearance. Moreover, for applications in extended reality and gaming, real-time rendering is critical. We present the first neural-implicit approach to photo-realistically render hands in real-time. This is a challenging problem as hands are textured and undergo strong articulations with pose-dependent effects. However, we show that this aim is achievable through our carefully designed method. This includes training on a low-resolution rendering of a neural radiance field, together with a 3D-consistent super-resolution module and mesh-guided sampling and space canonicalization. We demonstrate a novel application of perceptual loss on the image space, which is critical for learning details accurately. We also show a live demo where we photo-realistically render the human hand in real-time for the first time, while also modeling pose- and view-dependent appearance effects. We ablate all our design choices and show that they optimize for rendering speed and quality. Our code will be released to encourage further research in this area. The supplementary video can be found at: tinyurl.com/46uvujzn
翻译:人类手是我们与周围环境互动的主要媒介,其数字化是一个重要的问题。因此,其数字化是至关重要的,直接应用于VR / AR,游戏和媒体生产等领域。虽然有几项研究模拟手的几何形状,但对捕捉逼真外观的注意力很少。此外,对于扩展现实和游戏应用来说,实时渲染至关重要。我们提出了第一个通过神经隐式方法在实时中光真实地渲染手的方法。这是一个具有挑战性的问题,因为手部是有纹理的,并且在姿势相关效应下发生强烈的关节运动。但是,我们表明通过我们精心设计的方法可以实现这个目标。这包括对神经辐射场的低分辨率渲染进行训练,以及一种三维一致的超分辨率模块和基于网格的采样和空间规范化。我们展示了对图像空间的感知损失的新颖应用,这对于准确地学习细节至关重要。我们还展示了一个实时演示,在其中以首次实时方式逼真地呈现人类手,并模拟了姿势和视图相关的外观效果。我们分析了所有的设计选择,并表明它们优化了渲染速度和质量。我们的代码将被发布,以鼓励进一步的研究。说明视频可以在这里找到:tinyurl.com/46uvujzn