Aiming at recognizing and localizing the object of novel categories by a few reference samples, few-shot object detection is a quite challenging task. Previous works often depend on the fine-tuning process to transfer their model to the novel category and rarely consider the defect of fine-tuning, resulting in many drawbacks. For example, these methods are far from satisfying in the low-shot or episode-based scenarios since the fine-tuning process in object detection requires much time and high-shot support data. To this end, this paper proposes a plug-and-play few-shot object detection (PnP-FSOD) framework that can accurately and directly detect the objects of novel categories without the fine-tuning process. To accomplish the objective, the PnP-FSOD framework contains two parallel techniques to address the core challenges in the few-shot learning, i.e., across-category task and few-annotation support. Concretely, we first propose two simple but effective meta strategies for the box classifier and RPN module to enable the across-category object detection without fine-tuning. Then, we introduce two explicit inferences into the localization process to reduce its dependence on the annotated data, including explicit localization score and semi-explicit box regression. In addition to the PnP-FSOD framework, we propose a novel one-step tuning method that can avoid the defects in fine-tuning. It is noteworthy that the proposed techniques and tuning method are based on the general object detector without other prior methods, so they are easily compatible with the existing FSOD methods. Extensive experiments show that the PnP-FSOD framework has achieved the state-of-the-art few-shot object detection performance without any tuning method. After applying the one-step tuning method, it further shows a significant lead in both efficiency, precision, and recall, under varied evaluation protocols.
翻译:为了通过几个参考样本来认识和本地化新类型对象,少发物体探测是一项相当艰巨的任务。 以往的工作往往取决于微调进程,以便将模型转移到新类别,而很少考虑微调的缺陷,从而造成许多缺点。 例如,这些方法远未在低发或小事件假设中得到满足,因为物体探测微调进程需要大量时间和高发支持数据。 为此,本文件提议了一个插接和播放少发物体探测(PnP-FSOD)框架,这个框架可以准确和直接地探测新类别对象,而不用微调进程。 为了实现目标, PnP-FSOD框架包含两种平行技术,以应对微发或小发的学习中的核心挑战,即跨类任务和少发点支持。 具体地说,我们首先为箱分类后和 RPNPN 模块提出两种简单有效的元战略,以便能够在不作微调的情况下更方便跨类物体探测。 然后, 我们向本地化过程中引入两个明确的目标, 将快速的物体探测器应用一种直径分析方法, 在本地的精确度测试框架中, 既能显示直观的精度分析方法。