The human hand is the main medium through which we interact with our surroundings. Hence, its digitization is of uttermost importance, with direct applications in VR/AR, gaming, and media production amongst other areas. While there are several works for modeling the geometry and articulations of hands, little attention has been dedicated to capturing photo-realistic appearance. In addition, for applications in extended reality and gaming, real-time rendering is critical. In this work, we present the first neural-implicit approach to photo-realistically render hands in real-time. This is a challenging problem as hands are textured and undergo strong articulations with various pose-dependent effects. However, we show that this can be achieved through our carefully designed method. This includes training on a low-resolution rendering of a neural radiance field, together with a 3D-consistent super-resolution module and mesh-guided space canonicalization and sampling. In addition, we show the novel application of a perceptual loss on the image space is critical for achieving photorealism. We show rendering results for several identities, and demonstrate that our method captures pose- and view-dependent appearance effects. We also show a live demo of our method where we photo-realistically render the human hand in real-time for the first time in literature. We ablate all our design choices and show that our design optimizes for both photorealism and rendering speed. Our code will be released to encourage further research in this area.
翻译:人手是我们与周围环境互动的主要媒介。 因此, 人类手是我们与周围环境互动的主要媒介。 因此, 它的数字化是最重要的, 在 VR/ AR、 游戏和媒体制作等领域中直接应用。 虽然有几部关于模拟几何和手法的作品, 但很少注意捕捉摄影现实的外观。 此外, 对于在广泛现实和游戏中的应用, 实时显示至关重要。 在这项工作中, 我们展示了第一个对摄影现实做出选择的模糊的神经方法。 这是一个具有挑战性的问题, 因为手被发光, 并且有着各种依赖面貌的效果。 然而, 我们显示这可以通过我们精心设计的方法实现。 这包括低分辨率的拍摄光亮场培训, 以及一个3D相容的超分辨率模块和Mes- 制导的空间可感光化和取样。 此外, 我们展示了对图像空间的所有感知力损失的新应用, 对于实现摄影现实主义至关重要。 我们展示了一些真实的身份和真实的图像, 我们展示了我们真实的模型, 展示了一种真实的模型。