Monocular 3D human pose estimation from a single RGB image has received a lot attentions in the past few year. Pose inference models with competitive performance however require supervision with 3D pose ground truth data or at least known pose priors in their target domain. Yet, these data requirements in many real-world applications with data collection constraints may not be attainable. In this paper, we present a heuristic weakly supervised human pose (HW-HuP) solution to estimate 3D human pose in contexts that no ground truth 3D pose data is accessible, even for fine-tuning. HW-HuP learns partial pose priors from public 3D human pose datasets and uses easy-to-access observations from the target domain to iteratively estimate 3D human pose and shape in an optimization and regression hybrid cycle. In our design, depth data as an auxiliary information is employed as weak supervision during training, yet it is not needed for the inference. HW-HuP shows comparable performance on public benchmarks to the state-of-the-art approaches which benefit from full 3D pose supervision. In this paper, we focus on two practical applications of 3D pose estimation for individuals while in bed as well as infants, where no reliable 3D pose data exists.
翻译:在过去几年里,从一个 RGB 图像中得出的单体3D 人体表面估计得到了很多关注。 具有竞争性性能的假设模型需要3D 的监管,但3D 构成地面真相数据,或至少已知在目标领域具有先兆。 然而,许多具有数据收集限制的现实世界应用中的数据要求可能无法实现。 在本文中,我们提出了一个对3D 人类表面(HW-HuP)的估计方法:在无法获取地面真相3D 构成数据的情况下,即使为了微调,也无法获得这些数据。 HW-HuP 学习了公众3D 人体表面数据集的部分前缀,并使用目标领域易于获取的观测结果,迭接地估计3D 人类的外形和形状,形成优化和回归的混合周期。在我们的设计中,作为辅助信息的深度数据被作为培训过程中薄弱的监管手段使用,但并不需要用于推断。 HW-HuP 显示在公共基准上可与3D 完整监督的状态方法相比。 在本文中,我们侧重于3D 的两种实际应用数据,同时将3D 作为3D 的婴儿作为可靠的模型。