Human pose estimation aims to locate the human body parts and build human body representation (e.g., body skeleton) from input data such as images and videos. It has drawn increasing attention during the past decade and has been utilized in a wide range of applications including human-computer interaction, motion analysis, augmented reality, and virtual reality. Although the recently developed deep learning-based solutions have achieved high performance in human pose estimation, there still remain challenges due to insufficient training data, depth ambiguities, and occlusions. The goal of this survey paper is to provide a comprehensive review of recent deep learning-based solutions for both 2D and 3D pose estimation via a systematic analysis and comparison of these solutions based on their input data and inference procedures. More than 240 research papers since 2014 are covered in this survey. Furthermore, 2D and 3D human pose estimation datasets and evaluation metrics are included. Quantitative performance comparisons of the reviewed methods on popular datasets are summarized and discussed. Finally, the challenges involved, applications, and future research directions are concluded. We also provide a regularly updated project page on: \url{https://github.com/zczcwh/DL-HPE}
翻译:虽然最近开发的深层学习基础解决方案在人体构成方面表现良好,但由于培训数据不足、深度含糊不清和排斥等原因,仍然存在着挑战。本调查文件的目标是通过系统分析和比较这些解决方案,在图像和视频等投入数据的基础上对2D和3D的估算进行系统分析和比较,从而对这些解决方案的估算进行全面审查。自2014年以来,本调查覆盖了240多份研究论文。此外,还包含2D和3D的人体构成估计数据集和评价指标。对已审查的大众数据集方法的定量绩效比较进行了总结和讨论。最后,还完成了所涉及的挑战、应用和未来研究方向。我们还定期更新了项目页面:url{https://github.com/Chzw}:url{D://Github.c/Ezzw}。