Human pose estimation (HPE) in 3D is an active research field that have many applications in entertainment, health and sport science, robotics. In the last five years markerless motion captures techniques have seen their average error decrease from more than 10cm to less than 2cm today. This evolution is mainly driven by the improvements in 2D pose estimation task that benefited from the use of convolutional networks. However with the multiplication of different approaches it can be difficult to identify what is more adapted to the specifics of any applications. We suggest to classify existing methods with a taxonomy based on the performance criteria of accuracy, speed and robustness. We review more than twenty methods from the last three years. Additionally we analyze the metrics, benchmarks and structure of the different pose estimation systems and propose several direction for future research. We hope to offer a good introduction to 3D markerless pose estimation as well as discussing the leading contemporary algorithms.
翻译:3D中的人造估计(HPE)是一个活跃的研究领域,在娱乐、健康和体育科学、机器人科学方面有许多应用。在过去5年中,无标记的运动捕捉技术的平均误差从10厘米以上下降到今天的2厘米以下。这一演变主要受2D的改进驱动,因为利用革命网络,人类的估算任务从革命网络中受益。但随着不同方法的倍增,很难确定哪些方法更适合任何应用的具体细节。我们建议根据精确、速度和稳健性的业绩标准对现有方法进行分类。我们审查了过去三年的20多种方法。此外,我们分析了不同型号估测系统的衡量标准、基准和结构,并为未来研究提出了若干方向。我们希望对3D无标记的构成估计进行良好的介绍,并讨论主要的当代算法。