Few-shot object detection aims to detect instances of specific categories in a query image with only a handful of support samples. Although this takes less effort than obtaining enough annotated images for supervised object detection, it results in a far inferior performance compared to the conventional object detection methods. In this paper, we propose a meta-learning-based approach that considers the unique characteristics of each support sample. Rather than simply averaging the information of the support samples to generate a single prototype per category, our method can better utilize the information of each support sample by treating each support sample as an individual prototype. Specifically, we introduce two types of attention mechanisms for aggregating the query and support feature maps. The first is to refine the information of few-shot samples by extracting shared information between the support samples through attention. Second, each support sample is used as a class code to leverage the information by comparing similarities between each support feature and query features. Our proposed method is complementary to the previous methods, making it easy to plug and play for further improvement. We have evaluated our method on PASCAL VOC and COCO benchmarks, and the results verify the effectiveness of our method. In particular, the advantages of our method are maximized when there is more diversity among support data.
翻译:少量物体探测的目的是在查询图像中检测特定类别的情况,只有少量辅助样本。虽然这比获得足够的附加说明的图像以用于监视物体探测要少得多,但与常规物体探测方法相比,其性能远远不如常规物体探测方法差。在本文件中,我们提出一个基于元学习的方法,考虑到每个辅助样本的独特性。我们的方法不是简单地将支持样本的信息平均化,以生成每个类别的单一原型,而是将每个支持样本作为单个原型处理,从而更好地利用每个支持样本的信息。具体地说,我们采用两种关注机制来汇总查询和支持特征地图。首先,通过通过关注提取支持样本之间的共享信息来改进少数样本的信息。第二,每个支持样本都用作类代码,通过比较每一种支持特征和查询特征之间的相似性来利用信息。我们建议的方法是对以往方法的补充,便于插入和播放,以便进一步改进。我们评估了我们关于PASAL VOC和COCOCO基准的方法,并核实了我们方法的有效性。特别是,我们方法的优势是最大化的,因为数据具有更大的多样性。