The results obtained from state of the art human pose estimation (HPE) models degrade rapidly when evaluating people of a low resolution, but can super resolution (SR) be used to help mitigate this effect? By using various SR approaches we enhanced two low resolution datasets and evaluated the change in performance of both an object and keypoint detector as well as end-to-end HPE results. We remark the following observations. First we find that for people who were originally depicted at a low resolution (segmentation area in pixels), their keypoint detection performance would improve once SR was applied. Second, the keypoint detection performance gained is dependent on that persons pixel count in the original image prior to any application of SR; keypoint detection performance was improved when SR was applied to people with a small initial segmentation area, but degrades as this becomes larger. To address this we introduced a novel Mask-RCNN approach, utilising a segmentation area threshold to decide when to use SR during the keypoint detection step. This approach achieved the best results on our low resolution datasets for each HPE performance metrics.
翻译:使用各种甚高分辨率方法,我们提升了两个低分辨率数据集,并评估了对象和关键点检测器的性能变化,以及最终至终端的HPE结果。我们提到以下意见。首先,我们发现,对于最初被描绘为低分辨率(像素中的分解区域)的人,一旦应用了SR,其关键点检测性能就会提高。第二,关键点检测性能取决于在应用SR之前在原始图像中计数的人的像素;当对初始分解区块小的人应用SR时,关键点检测性能得到了改进,但随着其规模的扩大而降低。我们为此采用了一种新型的Mask-RCNN方法,在关键点检测步骤中利用分解区域阈值来决定何时使用SR。这一方法在每项HPE绩效指标的低分辨率数据集上取得了最佳结果。