In keypoint estimation tasks such as human pose estimation, heatmap-based regression is the dominant approach despite possessing notable drawbacks: heatmaps intrinsically suffer from quantization error and require excessive computation to generate and post-process. Motivated to find a more efficient solution, we propose to model individual keypoints and sets of spatially related keypoints (i.e., poses) as objects within a dense single-stage anchor-based detection framework. Hence, we call our method KAPAO (pronounced "Ka-Pow"), for Keypoints And Poses As Objects. KAPAO is applied to the problem of single-stage multi-person human pose estimation by simultaneously detecting human pose and keypoint objects and fusing the detections to exploit the strengths of both object representations. In experiments, we observe that KAPAO is faster and more accurate than previous methods, which suffer greatly from heatmap post-processing. The accuracy-speed trade-off is especially favourable in the practical setting when not using test-time augmentation. Source code: https://github.com/wmcnally/kapao.
翻译:在关键估计任务中,如人造表面估计,热映射回归是主要的方法,尽管存在明显的缺点:热映射本身就存在量化错误,需要过量计算才能产生和完成过程。为了找到更有效率的解决办法,我们提议将单个关键点和一组空间相关关键点(即姿势)作为密集的单级锚基探测框架范围内的物体进行模拟。因此,我们将“KAPAO”(宣布的“Ka-Pow”)方法(宣布的“KA-Pow”)称为“KAPAO”方法(宣布的“KA-Pow”),用于关键点和“Poses as Oblictives” 。 KAPAO用于单阶段多人姿势估测问题,方法是同时探测人造和关键点对象,并用探测来利用这两个物体的优势。在实验中,我们观察到,“KAPAO”比以前的方法更快、更准确,后者因热映射后处理而深受影响。当不使用测试-时间增强时,精确交易在实际情况下特别有利。资料来源代码:http://githhub.com/wcally/kapaly/kapao。