This paper tackles the problem of novel category discovery (NCD), which aims to discriminate unknown categories in large-scale image collections. The NCD task is challenging due to the closeness to the real-world scenarios, where we have only encountered some partial classes and images. Unlike other works on the NCD, we leverage the prototypes to emphasize the importance of category discrimination and alleviate the issue of missing annotations of novel classes. Concretely, we propose a novel adaptive prototype learning method consisting of two main stages: prototypical representation learning and prototypical self-training. In the first stage, we obtain a robust feature extractor, which could serve for all images with base and novel categories. This ability of instance and category discrimination of the feature extractor is boosted by self-supervised learning and adaptive prototypes. In the second stage, we utilize the prototypes again to rectify offline pseudo labels and train a final parametric classifier for category clustering. We conduct extensive experiments on four benchmark datasets and demonstrate the effectiveness and robustness of the proposed method with state-of-the-art performance.
翻译:本文探讨了新分类发现(NCD)问题,新分类发现的目的是在大规模图像采集中歧视未知类别。新分类发现(NCD)任务具有挑战性,因为与现实世界情景的距离很近,我们只遇到部分类别和图像。与其他关于NCD的工程不同,我们利用原型强调类别歧视的重要性,缓解新类缺失的注释问题。具体地说,我们提出一种新的适应性原型学习方法,由两个主要阶段组成:原型代表性学习和原型自我培训。在第一阶段,我们获得一个强健的地物提取器,它可以为所有基础和新类别图像服务。通过自我监督的学习和适应性原型,提高了地物提取器的这种实例和类别歧视能力。在第二阶段,我们再次利用原型来纠正离线假标签,为分类组合培训最后的参数分类器。我们在四个基准数据集上进行了广泛的实验,并展示了拟议方法在状态性表现方面的有效性和稳健性。