In this paper, we present a novel algorithm to extract a quaternion from a two dimensional camera frame for estimating a contained human skeletal pose. The problem of pose estimation is usually tackled through the usage of stereo cameras and intertial measurement units for obtaining depth and euclidean distance for measurement of points in 3D space. However, the usage of these devices comes with a high signal processing latency as well as a significant monetary cost. By making use of MediaPipe, a framework for building perception pipelines for human pose estimation, the proposed algorithm extracts a quaternion from a 2-D frame capturing an image of a human object at a sub-fifty millisecond latency while also being capable of deployment at edges with a single camera frame and a generally low computational resource availability, especially for use cases involving last-minute detection and reaction by autonomous robots. The algorithm seeks to bypass the funding barrier and improve accessibility for robotics researchers involved in designing control systems.
翻译:在本文中,我们提出了一个新颖的算法,从两维摄像框中提取四分之四,用于估计包含的人体骨骼结构; 表面估计问题通常通过使用立体照相机和中间测量器来解决,以获得深度和优clidean距离测量3D空间的点数; 然而,这些装置的使用伴随着高信号处理延迟度以及巨大的货币成本。 通过利用MediaPipe,即构建人类表面估计感知管道的框架,提议的算法从二维框中提取一个四分之一,从二维框中捕捉到一个半毫秒悬浮度以下的人类物体的图像,同时能够在边缘部署一个单一摄像头和一般较低的计算资源可用性,特别是用于涉及自主机器人最后一分钟的检测和反应的案例。 算法试图绕过资金障碍,改善参与设计控制系统的机器人研究人员的无障碍环境。</s>