泛化神经人体拟合到未见过的姿势与关节SE(3)等变性 (Generalizing Neural Human Fitting to Unseen Poses With Articulated SE(3) Equivariance)

We address the problem of fitting a parametric human body model (SMPL) to point cloud data. Optimization-based methods require careful initialization and are prone to becoming trapped in local optima. Learning-based methods address this but do not generalize well when the input pose is far from those seen during training. For rigid point clouds, remarkable generalization has been achieved by leveraging SE(3)-equivariant networks, but these methods do not work on articulated objects. In this work we extend this idea to human bodies and propose ArtEq, a novel part-based SE(3)-equivariant neural architecture for SMPL model estimation from point clouds. Specifically, we learn a part detection network by leveraging local SO(3) invariance, and regress shape and pose using articulated SE(3) shape-invariant and pose-equivariant networks, all trained end-to-end. Our novel equivariant pose regression module leverages the permutation-equivariant property of self-attention layers to preserve rotational equivariance. Experimental results show that ArtEq can generalize to poses not seen during training, outperforming state-of-the-art methods by 74.5%, without requiring an optimization refinement step. Further, compared with competing works, our method is more than three orders of magnitude faster during inference and has 97.3% fewer parameters. The code and model will be available for research purposes at https://arteq.is.tue.mpg.de.

翻译：我们解决了将参数人体模型(SMPL)拟合到点云数据的问题。基于优化的方法需要仔细初始化和容易陷入局部最优解。学习基础的方法解决了这个问题，但是在输入姿势与训练时差距较大时，这些方法很难泛化。对于刚性点云，利用SE(3)等变网络已经取得了非凡的泛化效果，但这些方法不能用于关节物体。在这项工作中，我们将这个想法扩展到人体，并提出了一种新的基于部分的SE(3)等变神经体系结构ArtEq，用于从点云中估计SMPL模型。具体来说，我们通过利用本地SO(3)不变性学习部分检测网络，并使用关节SE(3)形状不变和姿态等变网络进行形状和姿态回归，全部进行端到端学习。我们的新型等变姿势回归模块利用了自我注意层的置换等变性质来保持旋转等变性。实验结果表明，ArtEq能够泛化到训练时未见过的姿势，超过了最先进的方法74.5％，而不需要优化细化步骤。此外，与竞争方法相比，我们的方法在推理期间更快，参数少97.3％以上。代码和模型可用于 https://arteq.is.tue.mpg.de 进行研究目的。