Human pose estimation aims to locate the human body parts and build human body representation (e.g., body skeleton) from input data such as images and videos. It has drawn increasing attention during the past decade and has been utilized in a wide range of applications including human-computer interaction, motion analysis, augmented reality, and virtual reality. Although the recently developed deep learning-based solutions have achieved high performance in human pose estimation, there still remain challenges due to insufficient training data, depth ambiguities, and occlusion. The goal of this survey paper is to provide a comprehensive review of recent deep learning-based solutions for both 2D and 3D pose estimation via a systematic analysis and comparison of these solutions based on their input data and inference procedures. More than 250 research papers since 2014 are covered in this survey. Furthermore, 2D and 3D human pose estimation datasets and evaluation metrics are included. Quantitative performance comparisons of the reviewed methods on popular datasets are summarized and discussed. Finally, the challenges involved, applications, and future research directions are concluded. A regularly updated project page is provided: \url{https://github.com/zczcwh/DL-HPE}
翻译:虽然最近开发的深层学习基础解决方案在人体构成方面表现良好,但由于培训数据不足、深度模糊不清和封闭性,仍然存在着挑战。本调查文件的目标是通过系统分析和比较这些解决方案,在图像和视频等投入数据的基础上,对2D和3D近期的深层学习解决方案进行估算。自2014年以来,本调查覆盖了250多份研究论文。此外,还包含2D和3D人构成估计数据集和评价指标。对已审查的大众数据集方法的定量绩效比较进行了总结和讨论。最后,完成了所涉及的挑战、应用和未来研究方向。定期更新的项目网页:https://github.com/HP-Ezzwh: