This paper presents a novel unsupervised approach to reconstruct human shape and pose from noisy point cloud. Traditional approaches search for correspondences and conduct model fitting iteratively where a good initialization is critical. Relying on large amount of dataset with ground-truth annotations, recent learning-based approaches predict correspondences for every vertice on the point cloud; Chamfer distance is usually used to minimize the distance between a deformed template model and the input point cloud. However, Chamfer distance is quite sensitive to noise and outliers, thus could be unreliable to assign correspondences. To address these issues, we model the probability distribution of the input point cloud as generated from a parametric human model under a Gaussian Mixture Model. Instead of explicitly aligning correspondences, we treat the process of correspondence search as an implicit probabilistic association by updating the posterior probability of the template model given the input. A novel unsupervised loss is further derived that penalizes the discrepancy between the deformed template and the input point cloud conditioned on the posterior probability. Our approach is very flexible, which works with both complete point cloud and incomplete ones including even a single depth image as input. Our network is trained from scratch with no need to warm-up the network with supervised data. Compared to previous unsupervised methods, our method shows the capability to deal with substantial noise and outliers. Extensive experiments conducted on various public synthetic datasets as well as a very noisy real dataset (i.e. CMU Panoptic) demonstrate the superior performance of our approach over the state-of-the-art methods. Code can be found \url{https://github.com/wangsen1312/unsupervised3dhuman.git}
翻译:本文展示了重建人类形状和从噪音点云中造型的新颖且不受监督的方法 。 传统的方法是寻找通信和行为模型, 在良好的初始化非常关键的地方迭接。 依靠大量带有地面真相说明的数据集, 最近的学习基础方法预测了点云中每个脊椎的对应; Champer 距离通常用于将变形模板模型和输入点云之间的距离最小化 。 然而, Champer 距离对于噪音和外部线非常敏感, 因此可能不可靠, 指派通信。 为了解决这些问题, 我们用高尔斯混合模型中模拟人类模型生成的输入点云的概率分布模型进行模拟。 我们将通信搜索过程视为隐含的概率关联, 更新模板模型的外观概率模型模型和输入点之间的距离 。 一种新的、 未经监督的损失进一步推算出变异的模板和输入点的云点之间的差异, 因此, 我们的方法非常灵活, 使用完全的超标的超值云和不完全的人类模型, 包括经过监督的单个图像输入方法。