In this work we develop a generalizable and efficient Neural Radiance Field (NeRF) pipeline for high-fidelity free-viewpoint human body synthesis under settings with sparse camera views. Though existing NeRF-based methods can synthesize rather realistic details for human body, they tend to produce poor results when the input has self-occlusion, especially for unseen humans under sparse views. Moreover, these methods often require a large number of sampling points for rendering, which leads to low efficiency and limits their real-world applicability. To address these challenges, we propose a Geometry-guided Progressive NeRF~(GP-NeRF). In particular, to better tackle self-occlusion, we devise a geometry-guided multi-view feature integration approach that utilizes the estimated geometry prior to integrate the incomplete information from input views and construct a complete geometry volume for the target human body. Meanwhile, for achieving higher rendering efficiency, we introduce a geometry-guided progressive rendering pipeline, which leverages the geometric feature volume and the predicted density values to progressively reduce the number of sampling points and speed up the rendering process. Experiments on the ZJU-MoCap and THUman datasets show that our method outperforms the state-of-the-arts significantly across multiple generalization settings, while the time cost is reduced >70% via applying our efficient progressive rendering pipeline.
翻译:在这项工作中,我们开发了一个可以普遍和高效的神经辐射场(NeRF)管道,用于在摄像器视野稀少的环境中进行高非性自由视觉人体合成。虽然现有的NeRF方法可以合成人体的相当现实的细节,但当输入具有自我封闭性时,这些方法往往会产生不良的结果,特别是对于在鲜见的人而言;此外,这些方法往往需要大量的取样点,从而导致效率低,并限制其真实世界的适用性。为了应对这些挑战,我们提议了一种由几何制制导的进步NERF~(GP-NERF)系统。特别是为了更好地解决自我封闭问题,我们设计了一种以几何制制制导的多视图特征集成法,在将不完整的信息从输入视图中综合起来之前使用估计的几何法,为目标人体构建完整的几何体体体积。与此同时,为了提高效率,我们引入了一种由几何制制制导导的渐进式输管线,利用了测地特征量和预测的密度值,以逐步减少取样点的数量,并加快转换过程。我们在多式模型模型中应用了我们通用的模型,同时,在普通的模型中进行实验了我们的模型中降低了成本。