In recent years, object detection has experienced impressive progress. Despite these improvements, there is still a significant gap in the performance between the detection of small and large objects. We analyze the current state-of-the-art model, Mask-RCNN, on a challenging dataset, MS COCO. We show that the overlap between small ground-truth objects and the predicted anchors is much lower than the expected IoU threshold. We conjecture this is due to two factors; (1) only a few images are containing small objects, and (2) small objects do not appear enough even within each image containing them. We thus propose to oversample those images with small objects and augment each of those images by copy-pasting small objects many times. It allows us to trade off the quality of the detector on large objects with that on small objects. We evaluate different pasting augmentation strategies, and ultimately, we achieve 9.7\% relative improvement on the instance segmentation and 7.1\% on the object detection of small objects, compared to the current state of the art method on MS COCO.
翻译:近年来,物体探测工作取得了令人瞩目的进展。尽管取得了这些改进,但发现大小物体的性能之间仍然存在着巨大差距。我们在具有挑战性的数据集MS COCO上分析了目前最先进的模型Mask-RCNN。我们表明,小型地面实况天体和预测锚的重叠比预期的IOU临界值要低得多。我们推测这有两个原因:(1)只有少量图像含有小物体,(2)小物体在每张含有这些物体的图像中都看不到足够多的。我们因此建议用复制式小物体来过多地标出这些小物体的图像,并用复制式的小型物体来放大每幅图像。它使我们能够将大型物体的探测器质量与小物体的大型物体进行交换。我们评估了不同的粘附增强战略,最终,我们比MS CO CO 目前的艺术方法,在小物体的物体探测方面实现了9.7 ⁇ 相对改进。