Physical contact provides additional constraints for hand-object state reconstruction as well as a basis for further understanding of interaction affordances. Estimating these severely occluded regions from monocular images presents a considerable challenge. Existing methods optimize the hand-object contact driven by distance threshold or prior from contact-labeled datasets. However, due to the number of subjects and objects involved in these indoor datasets being limited, the learned contact patterns could not be generalized easily. Our key idea is to reconstruct the contact pattern directly from monocular images, and then utilize the physical stability criterion in the simulation to optimize it. This criterion is defined by the resultant forces and contact distribution computed by the physics engine.Compared to existing solutions, our framework can be adapted to more personalized hands and diverse object shapes. Furthermore, an interaction dataset with extra physical attributes is created to verify the sim-to-real consistency of our methods. Through comprehensive evaluations, hand-object contact can be reconstructed with both accuracy and stability by the proposed framework.
翻译:物理接触为亲身物体国家重建提供了额外的限制,也为进一步理解互动提供基础。用单眼图像来估计这些严重隐蔽的区域带来了相当大的挑战。现有方法优化由距离阈值驱动的亲身物体接触,或先从接触标签数据集驱动的亲身物体接触。然而,由于这些室内数据集所涉及的主题和对象数量有限,所学的接触模式无法轻易地普及。我们的关键想法是从单眼图像直接重建接触模式,然后在模拟中使用物理稳定性标准来优化它。这一标准由物理引擎计算的结果力量和接触分布来界定。与现有解决方案相比,我们的框架可以适应更个性化的手和不同对象形状。此外,还创建了带有额外物理属性的互动数据集,以核实我们方法的模子到真实一致性。通过全面评估,手脚接触可以通过拟议框架的准确性和稳定性来重建。