We propose a novel sparse constrained formulation and from it derive a real-time optimization method for 3D human pose and shape estimation. Our optimization method is orders of magnitude faster (avg. 4 ms convergence) than existing optimization methods, while being mathematically equivalent to their dense unconstrained formulation. We achieve this by exploiting the underlying sparsity and constraints of our formulation to efficiently compute the Gauss-Newton direction. We show that this computation scales linearly with the number of joints of a complex 3D human model, in contrast to prior work where it scales cubically due to their dense unconstrained formulation. Based on our optimization method, we present a real-time motion capture framework that estimates 3D human poses and shapes from a single image at over 30 FPS. In benchmarks against state-of-the-art methods on multiple public datasets, our frame-work outperforms other optimization methods and achieves competitive accuracy against regression methods.
翻译:我们建议一种新颖的稀有限制配方,并从中得出一种用于3D人形和形状估计的实时优化方法。我们的优化方法比现有的优化方法更快数量级(avg.4ms 趋同),在数学上相当于其密集而不受限制的配方。我们通过利用我们配方的基本宽度和限制来有效地计算高斯-牛顿方向,实现这一点。我们用复杂的3D人模型的接合点数量来显示这种线性计算尺度,而以前的工作则由于这种3D型的配方密度高且不受限制而异。根据我们的优化方法,我们提出了一个实时动作捕捉取框架,根据30多个FPS的单一图像来估计3D人的容和形状。在参照多个公共数据集的最新方法时,我们的框架工作比其他优化方法要优于其他的精确度,并对照回归方法实现竞争性的精确性。