The rapid development of multi-view 3D human pose estimation (HPE) is attributed to the maturation of monocular 2D HPE and the geometry of 3D reconstruction. However, 2D detection outliers in occluded views due to neglect of view consistency, and 3D implausible poses due to lack of pose coherence, remain challenges. To solve this, we introduce a Multi-View Fusion module to refine 2D results by establishing view correlations. Then, Holistic Triangulation is proposed to infer the whole pose as an entirety, and anatomy prior is injected to maintain the pose coherence and improve the plausibility. Anatomy prior is extracted by PCA whose input is skeletal structure features, which can factor out global context and joint-by-joint relationship from abstract to concrete. Benefiting from the closed-form solution, the whole framework is trained end-to-end. Our method outperforms the state of the art in both precision and plausibility which is assessed by a new metric.
翻译:多视图 3D 人形估计(HPE)的快速发展是由于单眼 2D HPE 的成熟和3D 重建的几何学造成的。然而,由于对视觉一致性的忽视,在隐蔽观点中2D 检测外生值,以及由于对视觉一致性的忽视,3D 无法令人信服的外生值,仍然是挑战。为了解决这个问题,我们引入了一个多视图组合模块,通过建立视觉相关性来改进2D结果。然后,建议进行全方位三角测量,将整个外生值推入整体,并先注入解剖学,以保持外容一致性,提高外生性。之前解剖学由具有骨骼结构特征的五氯苯甲醚提取,其投入可以将全球环境和从抽象到混凝土的连成关系作为因素。从封闭式解决方案中受益,整个框架经过培训,最终到终端。我们的方法在精准性和合理性两方面都不符合由新指标评估的艺术状况。