For nearly a decade, the COCO dataset has been the central test bed of research in object detection. According to the recent benchmarks, however, it seems that performance on this dataset has started to saturate. One possible reason can be that perhaps it is not large enough for training deep models. To address this limitation, here we introduce two complementary datasets to COCO: i) COCO_OI, composed of images from COCO and OpenImages (from their 80 classes in common) with 1,418,978 training bounding boxes over 380,111 images, and 41,893 validation bounding boxes over 18,299 images, and ii) ObjectNet_D containing objects in daily life situations (originally created for object recognition known as ObjectNet; 29 categories in common with COCO). The latter can be used to test the generalization ability of object detectors. We evaluate some models on these datasets and pinpoint the source of errors. We encourage the community to utilize these datasets for training and testing object detection models. Code and data is available at https://github.com/aliborji/COCO_OI.
翻译:近十年来,COCO数据集一直是物体探测研究的中心测试台,但根据最近的基准,该数据集的性能似乎已开始饱和。一个可能的原因可能是该数据集的性能可能不足以培训深层模型。为解决这一局限性,我们在此向COCO引入两个补充数据集:i)COCO_OI,由COCO和OpenImaage(共80个类别)的图像组成,共有1,418,978个训练框,超过380,111个图像,41,893个验证框,超过18,299个图像,以及ii)OcalNet_D,包含日常生活中的物体(最初为确认物体而创建的物体名称为Ocornet,29个类别与COCO相同),后者可用于测试物体探测器的一般化能力。我们对这些数据集和OpenI的一些模型进行评估,并确定出误差源。我们鼓励社区利用这些数据集来培训和测试物体探测模型。代码和数据见https://github.com/alibji/CO_OI。