Expensive bounding-box annotations have limited the development of object detection task. Thus, it is necessary to focus on more challenging task of few-shot object detection. It requires the detector to recognize objects of novel classes with only a few training samples. Nowadays, many existing popular methods based on meta-learning have achieved promising performance, such as Meta R-CNN series. However, only a single category of support data is used as the attention to guide the detecting of query images each time. Their relevance to each other remains unexploited. Moreover, a lot of recent works treat the support data and query images as independent branch without considering the relationship between them. To address this issue, we propose a dynamic relevance learning model, which utilizes the relationship between all support images and Region of Interest (RoI) on the query images to construct a dynamic graph convolutional network (GCN). By adjusting the prediction distribution of the base detector using the output of this GCN, the proposed model can guide the detector to improve the class representation implicitly. Comprehensive experiments have been conducted on Pascal VOC and MS-COCO dataset. The proposed model achieves the best overall performance, which shows its effectiveness of learning more generalized features. Our code is available at https://github.com/liuweijie19980216/DRL-for-FSOD.
翻译:因此,有必要侧重于更具有挑战性的微小物体探测任务,要求探测器识别新课程对象,只有少量培训样本。如今,许多基于元学习的现有流行方法,如Meta R-CNN系列,都取得了有希望的性能。然而,只使用单一类别的支持数据来指导每次探测查询图像的注意,这些数据对彼此的相关性仍未得到利用。此外,许多近期工作将支持数据和查询图像作为独立分支处理,而不考虑它们之间的关系。为了解决这个问题,我们提出了一个动态相关性学习模型,利用所有支持图像与查询图像中感兴趣的区域(ROI)之间的关系,以构建动态图象革命网络(GCN)。通过使用GCN输出调整基础探测器的预测分布,拟议的模型可以指导探测器暗中改善课堂代表性。在Pascal VOC和MS-COCO数据集方面进行了全面实验。拟议模型将实现最佳的总体性性能,显示其可普及性能。