In this paper, we study object detection using a large pool of unlabeled images and only a few labeled images per category, named "few-example object detection". The key challenge consists in generating trustworthy training samples as many as possible from the pool. Using few training examples as seeds, our method iterates between model training and high-confidence sample selection. In training, easy samples are generated first and, then the poorly initialized model undergoes improvement. As the model becomes more discriminative, challenging but reliable samples are selected. After that, another round of model improvement takes place. To further improve the precision and recall of the generated training samples, we embed multiple detection models in our framework, which has proven to outperform the single model baseline and the model ensemble method. Experiments on PASCAL VOC'07, MS COCO'14, and ILSVRC'13 indicate that by using as few as three or four samples selected for each category, our method produces very competitive results when compared to the state-of-the-art weakly-supervised approaches using a large number of image-level labels.
翻译:在本文中,我们使用大量未贴标签的图像来研究物体探测,每个类别只有几张标签的图像,称为“few-example objects control” 。关键的挑战在于尽可能多地从池中生成可靠的培训样本。使用很少的培训实例作为种子,我们的方法在示范培训与高自信抽样选择之间相互交叉。在培训中,先生成容易的样本,然后不易初始化的模式就会得到改进。随着模型变得更有歧视,具有挑战性,但可靠的样本被选择。之后,又进行另一轮模型改进。为了进一步提高所生成的培训样本的精确度和回顾性,我们将多种检测模型嵌入我们的框架中,这已经证明超越了单一模型基线和模型组装方法。在PASAL VOC'07、MS COCO'14和 ILSVRC'13上进行的实验表明,与使用大量图像等级标签的、最先进的弱监督方法相比,我们的方法产生非常有竞争力的结果。