We address the problem of generalized category discovery (GCD) in this paper, i.e. clustering the unlabeled images leveraging the information from a set of seen classes, where the unlabeled images could contain both seen classes and unseen classes. The seen classes can be seen as an implicit criterion of classes, which makes this setting different from unsupervised clustering where the cluster criteria may be ambiguous. We mainly concern the problem of discovering categories within a fine-grained dataset since it is one of the most direct applications of category discovery, i.e. helping experts discover novel concepts within an unlabeled dataset using the implicit criterion set forth by the seen classes. State-of-the-art methods for generalized category discovery leverage contrastive learning to learn the representations, but the large inter-class similarity and intra-class variance pose a challenge for the methods because the negative examples may contain irrelevant cues for recognizing a category so the algorithms may converge to a local-minima. We present a novel method called Expert-Contrastive Learning (XCon) to help the model to mine useful information from the images by first partitioning the dataset into sub-datasets using k-means clustering and then performing contrastive learning on each of the sub-datasets to learn fine-grained discriminative features. Experiments on fine-grained datasets show a clear improved performance over the previous best methods, indicating the effectiveness of our method.
翻译:在本文中,我们处理通用分类发现问题,即利用一组可见分类中的信息,将未贴标签的图像集中起来,利用一组可见分类中的信息,使未贴标签的图像能够包含可见类和不可见类。可见类可以被视为一个隐含的分类标准,这使得这种设置不同于未监督的群集,因为群集标准可能模棱两可。我们主要关注在细微分类数据集中发现类别的问题,因为它是分类发现的最直接应用之一,即帮助专家利用被观察类中设定的隐含标准,在未贴标签的数据集中发现新概念。通用类别发现的国家艺术方法利用对比学习的学习方法,学习代表性,但大类间相似性和类内差异对方法提出了挑战,因为负面例子可能包含与识别某个类别无关的提示,因此算法可能与本地最小化相融合。我们提出了一个名为专家-融合学习(XCon)的新方法,帮助专家从一个未贴标签的数据集中找到新的概念,先将数据分组分析,然后用最精确的方法将数据分解,然后用最精确的方法显示每个分析性数据,然后用最精确的方法,用最精确的方法来学习。