This paper focuses on the problem of 3D human reconstruction from 2D evidence. Although this is an inherently ambiguous problem, the majority of recent works avoid the uncertainty modeling and typically regress a single estimate for a given input. In contrast to that, in this work, we propose to embrace the reconstruction ambiguity and we recast the problem as learning a mapping from the input to a distribution of plausible 3D poses. Our approach is based on the normalizing flows model and offers a series of advantages. For conventional applications, where a single 3D estimate is required, our formulation allows for efficient mode computation. Using the mode leads to performance that is comparable with the state of the art among deterministic unimodal regression models. Simultaneously, since we have access to the likelihood of each sample, we demonstrate that our model is useful in a series of downstream tasks, where we leverage the probabilistic nature of the prediction as a tool for more accurate estimation. These tasks include reconstruction from multiple uncalibrated views, as well as human model fitting, where our model acts as a powerful image-based prior for mesh recovery. Our results validate the importance of probabilistic modeling, and indicate state-of-the-art performance across a variety of settings. Code and models are available at: https://www.seas.upenn.edu/~nkolot/projects/prohmr.
翻译:本文侧重于2D证据的3D人重建问题。 虽然这是一个内在的模糊问题,但大多数近期著作避免了不确定性的模型模型,通常会将某一投入的单一估计结果退缩。与此相反,我们提议接受重建的模糊性,并将问题重新表述为从输入到合理3D构成分布的图解。我们的方法是以正常流模式为基础,提供一系列优势。对于常规应用,需要单一3D估计,我们的配方允许高效计算模式。使用模式导致的绩效与确定性单式回归模型中的最新状态相当。同时,由于我们有机会获得每个样本的可能性,我们建议接受重建的模糊性,我们把问题重新表述为从输入到合理3D构成分布的图案。我们的方法是利用预测的概率性作为更准确估计的工具。这些任务包括从多种未经校正的观点进行重建,以及人类模型的完善,我们模型在 mesh恢复之前作为强有力的图像基础。我们的成果证实了概率模型/正态模型的重要性,同时,我们又确认了在各种样本/正式模型中展示了一种状态/ 。