Assuming unknown classes could be present during classification, the open set recognition (OSR) task aims to classify an instance into a known class or reject it as unknown. In this paper, we use a two-stage training strategy for the OSR problems. In the first stage, we introduce a self-supervised feature decoupling method that finds the content features of the input samples from the known classes. Specifically, our feature decoupling approach learns a representation that can be split into content features and transformation features. In the second stage, we fine-tune the content features with the class labels. The fine-tuned content features are then used for the OSR problems. Moreover, we consider an unsupervised OSR scenario, where we cluster the content features learned from the first stage. To measure representation quality, we introduce intra-inter ratio (IIR). Our experimental results indicate that our proposed self-supervised approach outperforms others in image and malware OSR problems. Also, our analyses indicate that IIR is correlated with OSR performance.
翻译:假设分类期间可能存在未知类别,开放式识别(OSR)任务的目的是将一个实例分类为已知类别,或拒绝将其列为未知类别。在本文中,我们针对OSR问题采用两阶段培训战略。在第一阶段,我们采用自监督特征分离方法,从已知类别中找到输入样本的内容特征。具体地说,我们的特征脱钩方法学会了可以分为内容特征和变异特征的表达方式。在第二阶段,我们用类标签微调内容特征。随后,微调内容特征用于处理OSR问题。此外,我们考虑一种不受监督的 OSR情景,我们在此将第一阶段学到的内容组合在一起。为了衡量代表性质量,我们引入内部比例。我们的实验结果表明,我们提议的自监督方法在图像和恶意 OSR问题中比其他人更相容。此外,我们的分析表明,IIR与OSR性能相关。