Open world object detection aims at detecting objects that are absent in the object classes of the training data as unknown objects without explicit supervision. Furthermore, the exact classes of the unknown objects must be identified without catastrophic forgetting of the previous known classes when the corresponding annotations of unknown objects are given incrementally. In this paper, we propose a two-stage training approach named Open World DETR for open world object detection based on Deformable DETR. In the first stage, we pre-train a model on the current annotated data to detect objects from the current known classes, and concurrently train an additional binary classifier to classify predictions into foreground or background classes. This helps the model to build an unbiased feature representations that can facilitate the detection of unknown classes in subsequent process. In the second stage, we fine-tune the class-specific components of the model with a multi-view self-labeling strategy and a consistency constraint. Furthermore, we alleviate catastrophic forgetting when the annotations of the unknown classes becomes available incrementally by using knowledge distillation and exemplar replay. Experimental results on PASCAL VOC and MS-COCO show that our proposed method outperforms other state-of-the-art open world object detection methods by a large margin.
翻译:开放世界物体探测的目的是探测培训数据对象类别中不存在的物体,这是没有明确监督的未知对象。此外,在对未知对象的相应说明进行递增时,必须查明未知对象的确切类别,而不要灾难性地忘记以前已知的类别。在本文中,我们提出一个名为Open World DETR的两阶段培训方法,用于根据变形的DETR对开放世界物体进行探测。在第一阶段,我们先对当前附加说明的数据进行模型培训,以便从已知的类别中探测对象,同时培训一个额外的二进制分类器,将预测分类成前地或背景类别。这有助于模型建立公正的特征显示,便于在以后的进程中探测未知对象类别。在第二阶段,我们用多视角的自我标签战略和一致性限制来微调模型中特定类别的组成部分。此外,我们通过知识蒸馏和外延再现,对未知类别说明逐渐提供灾难性的遗漏。关于PASAL VOC和MS-COCO的实验结果,有助于模型建立公正的特征说明,从而在以后发现未知的世界范围内的大规模探测方法。