Estimating hand-object (HO) pose during interaction has been brought remarkable growth in virtue of deep learning methods. Modeling the contact between the hand and object properly is the key to construct a plausible grasp. Yet, previous works usually focus on jointly estimating HO pose but not fully explore the physical contact preserved in grasping. In this paper, we present an explicit contact representation, Contact Potential Field (CPF) that models each hand-object contact as a spring-mass system. Then we can refine a natural grasp by minimizing the elastic energy w.r.t those systems. To recover CPF, we also propose a learning-fitting hybrid framework named MIHO. Extensive experiments on two public benchmarks have shown that our method can achieve state-of-the-art in several reconstruction metrics, and allow us to produce more physically plausible HO pose even when the ground-truth exhibits severe interpenetration or disjointedness. Our code is available at https://github.com/lixiny/CPF .
翻译:通过深层次的学习方法,在互动过程中对手粒子(HO)的外形进行估计已经带来了显著的增长。对手粒子和物体之间的接触进行适当建模是构建一个合理理解的关键。然而,以往的工作通常侧重于共同估计HO的外形,但并不完全探索在捕捉中保存的物理接触。在本文中,我们提出了一个明确的接触代表,即接触潜力领域,将每个手粒子的接触作为弹簧质系统进行模型。然后,我们就可以通过将弹性能量最小化来改进自然捕捉。为了恢复CFP,我们还提议了一个适合学习的混合框架,名为MIHO。关于两个公共基准的广泛实验表明,我们的方法可以在几个重建指标中达到最新水平,并使我们能够产生更符合实际的HOM的外形。我们的代码可以在https://github.com/lixiny/CPF中查阅。