Non-maximum suppression (NMS) is essential for state-of-the-art object detectors to localize object from a set of candidate locations. However, accurate candidate location sometimes is not associated with a high classification score, which leads to object localization failure during NMS. In this paper, we introduce a novel bounding box regression loss for learning bounding box transformation and localization variance together. The resulting localization variance exhibits a strong connection to localization accuracy, which is then utilized in our new non-maximum suppression method to improve localization accuracy for object detection. On MS-COCO, we boost the AP of VGG-16 faster R-CNN from 23.6% to 29.1% with a single model and nearly no additional computational overhead. More importantly, our method is able to improve the AP of ResNet-50 FPN fast R-CNN from 36.8% to 37.8%, which achieves state-of-the-art bounding box refinement result.
翻译:非最大抑制(NMS)对于从一组候选地点定位物体的最先进的物体探测器至关重要。 但是,准确的候选位置有时与高分类分数无关,这导致NMS期间目标定位失败。 在本文中,我们引入了一种新的捆绑框回归损失,以学习捆绑框变异和本地化差异。由此产生的本地化差异显示出与本地化精确度的强烈关联,然后在新的非最大抑制方法中使用该方法来提高物体探测的本地化精确度。 在MS-COCO上,我们用单一模型将VGG-16的AP从23.6%提高到29.1%,几乎没有额外的计算间接费用。更重要的是,我们的方法能够将ResNet-50 FPN 快速R-CNN的AP从36.8%提高到37.8%,从而实现最先进的捆绑框改进结果。