Human beings not only have the ability to recognize novel unseen classes, but also can incrementally incorporate the new classes to existing knowledge preserved. However, zero-shot learning models assume that all seen classes should be known beforehand, while incremental learning models cannot recognize unseen classes. This paper introduces a novel and challenging task of Incrementally Zero-Shot Detection (IZSD), a practical strategy for both zero-shot learning and class-incremental learning in real-world object detection. An innovative end-to-end model -- IZSD-EVer was proposed to tackle this task that requires incrementally detecting new classes and detecting the classes that have never been seen. Specifically, we propose a novel extreme value analyzer to detect objects from old seen, new seen, and unseen classes, simultaneously. Additionally and technically, we propose two innovative losses, i.e., background-foreground mean squared error loss alleviating the extreme imbalance of the background and foreground of images, and projection distance loss aligning the visual space and semantic spaces of old seen classes. Experiments demonstrate the efficacy of our model in detecting objects from both the seen and unseen classes, outperforming the alternative models on Pascal VOC and MSCOCO datasets.
翻译:人类不仅有能力认识新颖的不可见阶级,而且还可以逐步将新课程纳入保存的现有知识中。然而,零点学习模式假定所有被看见的班级都应事先为人所知,而渐进式学习模式则无法同时认识不可见班级。本文介绍了一个新颖而具有挑战性的任务,即递增零热探测(IZSD),这是现实世界物体探测中零点学习和课堂强化学习的实用战略。一个创新的端到端模式 -- -- 提议ISSD-EVer来应对这项任务,这需要逐步发现新班级并发现从未见过的班级。具体地说,我们提出了一个新的极端价值分析器,以同时从旧的、新看的和看不见班级中探测对象。此外,在技术上,我们提出了两个创新损失,即背景前表面平均正方差损失,缓解了背景和图像表面的极端失衡,并预测距离损失,以适应旧班级的视觉空间和语义空间。实验显示我们模型在从已见和看不见班级中探测对象对象方面的效率。