Marker-less monocular 3D human motion capture (MoCap) with scene interactions is a challenging research topic relevant for extended reality, robotics and virtual avatar generation. Due to the inherent depth ambiguity of monocular settings, 3D motions captured with existing methods often contain severe artefacts such as incorrect body-scene inter-penetrations, jitter and body floating. To tackle these issues, we propose HULC, a new approach for 3D human MoCap which is aware of the scene geometry. HULC estimates 3D poses and dense body-environment surface contacts for improved 3D localisations, as well as the absolute scale of the subject. Furthermore, we introduce a 3D pose trajectory optimisation based on a novel pose manifold sampling that resolves erroneous body-environment inter-penetrations. Although the proposed method requires less structured inputs compared to existing scene-aware monocular MoCap algorithms, it produces more physically-plausible poses: HULC significantly and consistently outperforms the existing approaches in various experiments and on different metrics. Project page: https://vcai.mpi-inf.mpg.de/projects/HULC/.
翻译:3D人类运动捕获(MoCap)与现场互动是一个具有挑战性的研究课题,与扩大的现实、机器人和虚拟活体生成相关。由于单子环境固有的深度模糊性,以现有方法捕捉的3D运动往往包含不正确的身体-皮肤间穿透、颤动和漂浮等严重人工制品。为了解决这些问题,我们提议HULC,3D人运动捕获的新办法,即了解现场几何的3D人运动。HULC估计3D构成和密集的身体-环境表面接触,以改善3D本地化,以及该主题的绝对规模。此外,我们采用3D构成轨迹优化,基于新式的多重取样,解决了错误的身体-环境间穿透。虽然拟议方法需要结构化较少的投入,而与现有的景色单体摩卡算法相比,但它产生更有形的成型:HULC明显和一贯地超越了各种实验和不同指标中的现有方法。项目页面: https://vcai.mpi-inf.mpe/ximus。