This paper presents a novel alternative to Greedy Non-Maxima Suppression (NMS) in the task of bounding box selection and suppression in object detection. It proposes Confluence, an algorithm which does not rely solely on individual confidence scores to select optimal bounding boxes, nor does it rely on Intersection Over Union (IoU) to remove false positives. Using Manhattan Distance, it selects the bounding box which is closest to every other bounding box within the cluster and removes highly confluent neighboring boxes. Thus, Confluence represents a paradigm shift in bounding box selection and suppression as it is based on fundamentally different theoretical principles to Greedy NMS and its variants. Confluence is experimentally validated on RetinaNet, YOLOv3 and Mask-RCNN, using both the MS COCO and PASCAL VOC 2007 datasets. Confluence outperforms Greedy NMS in both mAP and recall on both datasets, using the challenging 0.50:0.95 mAP evaluation metric. On each detector and dataset, mAP was improved by 0.3-0.7% while recall was improved by 1.4-2.5%. A theoretical comparison of Greedy NMS and the Confluence Algorithm is provided, and quantitative results are supported by extensive qualitative results analysis. Furthermore, sensitivity analysis experiments across mAP thresholds support the conclusion that Confluence is more robust than NMS.
翻译:本文介绍了在物体检测中约束框的选择和抑制任务中,贪婪非麦西马非抑制(NMS)是一种创新的替代方法。 它提出了集成, 这种算法并不完全依靠个人信任评分来选择最佳的捆绑框, 也不依靠交叉重叠联盟(IoU)来消除假阳性。 它使用曼哈顿距离, 选择了最接近集群内所有其他约束框的捆绑框, 并删除了高度松散的邻接框。 因此, 影响代表了约束框选择和抑制的范式转变, 因为它基于对贪心NMS及其变异的完全不同的理论原则。 影响是用雷蒂纳Net、 YOLOv3 和 Mask-RCNNN 进行实验性的验证, 使用 MS COCOCO 和 PCAL VOC 2007 数据集 进行实验。 影响外延缩放Greedy NMS 和回顾这两个数据集, 具有挑战性的 0.50. 0.95 mAP 评估指标。 关于每个探测器和数据设置, mAP 改进了0.3-0.7% 和Gremigration 分析结果。