This paper presents an inverse kinematic optimization layer (IKOL) for 3D human pose and shape estimation that leverages the strength of both optimization- and regression-based methods within an end-to-end framework. IKOL involves a nonconvex optimization that establishes an implicit mapping from an image's 3D keypoints and body shapes to the relative body-part rotations. The 3D keypoints and the body shapes are the inputs and the relative body-part rotations are the solutions. However, this procedure is implicit and hard to make differentiable. So, to overcome this issue, we designed a Gauss-Newton differentiation (GN-Diff) procedure to differentiate IKOL. GN-Diff iteratively linearizes the nonconvex objective function to obtain Gauss-Newton directions with closed form solutions. Then, an automatic differentiation procedure is directly applied to generate a Jacobian matrix for end-to-end training. Notably, the GN-Diff procedure works fast because it does not rely on a time-consuming implicit differentiation procedure. The twist rotation and shape parameters are learned from the neural networks and, as a result, IKOL has a much lower computational overhead than most existing optimization-based methods. Additionally, compared to existing regression-based methods, IKOL provides a more accurate mesh-image correspondence. This is because it iteratively reduces the distance between the keypoints and also enhances the reliability of the pose structures. Extensive experiments demonstrate the superiority of our proposed framework over a wide range of 3D human pose and shape estimation methods.
翻译:本文为 3D 人的外观和形状提供了一个反动运动优化层( IKOL ), 以3D 人的外观和形状来进行估计, 从而在端对端框架内利用优化和回归法的强度。 IKOL 涉及非默认优化, 从图像的 3D 关键点和正文形状到相对的正文部分旋转, 暗含映射图。 3D 关键点和身体形状是输入, 相对的正文部分旋转是解决方案 。 但是, 这个程序是隐含的, 很难使差异化。 因此, 为了克服这个问题, 我们设计了一个高斯- 牛顿 偏差( GN- Diff) 程序来区分 IKO 的强度。 GN- Diff 包含一个非colvex 优化, 将非默认目标函数从 3D 的 3D 显示为默认方向, 以封闭式图解析定, 直接应用自动区分程序来生成叶花式矩阵 培训。 值得注意的是, GN- Diff 程序运行速度快速, 因为它不依赖于耗时的隐含的隐含差异程序 程序 。 。 这个旋转和形状参数参数的参数的参数的参数是比 更精确的 。 。 这个比较的更精确的缩缩算法 。