We observe that human poses exhibit strong group-wise structural correlation and spatial coupling between keypoints due to the biological constraints of different body parts. This group-wise structural correlation can be explored to improve the accuracy and robustness of human pose estimation. In this work, we develop a self-constrained prediction-verification network to characterize and learn the structural correlation between keypoints during training. During the inference stage, the feedback information from the verification network allows us to perform further optimization of pose prediction, which significantly improves the performance of human pose estimation. Specifically, we partition the keypoints into groups according to the biological structure of human body. Within each group, the keypoints are further partitioned into two subsets, high-confidence base keypoints and low-confidence terminal keypoints. We develop a self-constrained prediction-verification network to perform forward and backward predictions between these keypoint subsets. One fundamental challenge in pose estimation, as well as in generic prediction tasks, is that there is no mechanism for us to verify if the obtained pose estimation or prediction results are accurate or not, since the ground truth is not available. Once successfully learned, the verification network serves as an accuracy verification module for the forward pose prediction. During the inference stage, it can be used to guide the local optimization of the pose estimation results of low-confidence keypoints with the self-constrained loss on high-confidence keypoints as the objective function. Our extensive experimental results on benchmark MS COCO and CrowdPose datasets demonstrate that the proposed method can significantly improve the pose estimation results.
翻译:我们观察到,由于身体不同部分的生物制约,人体构成的临界点在群体上表现出强大的结构相关性和空间连接。可以探索这一群体结构相关性,以提高人体构成估计的准确性和稳健性。在这项工作中,我们开发了一个自我约束的预测核查网络,以描述和学习培训中关键点之间的结构性相关性。在推论阶段,核查网络的反馈信息使我们能够进一步优化构成预测,从而大大改善人体构成估计的性能。具体地说,我们根据人体的生物结构将关键点分成若干组。在每一组中,关键点被进一步分成两个组,即高度信心基点和低信心终端关键点。我们开发了一个自我约束的预测核查网络,以便在这些关键点子之间进行前向和后向预测。在作出预测时,一个根本性的挑战是,我们没有机制来核查获得的估算或预测结果是否准确或没有准确,因为地面的真相是不存在的。在先期的阶段,先期的预测中,先期的精确性结果将显示为先期的准确性。在先期的预测中,先期的准确性测试中,先期的精确度将先期数据作为先期估算。