Current state-of-the-art Wildlife classification models are trained under the closed world setting. When exposed to unknown classes, they remain overconfident in their predictions. Open-set Recognition (OSR) aims to classify known classes while rejecting unknown samples. Several OSR methods have been proposed to model the closed-set distribution by observing the feature, logit, or softmax probability space. A significant drawback of many existing approaches is the requirement to retrain the pre-trained classification model with the OSR-specific strategy. This study contributes a post-processing OSR method that measures the agreement between the models' features and predicted logits. We propose a probability distribution based on an input's distance to its Nearest Class Mean (NCM). The NCM-based distribution is then compared with the softmax probabilities from the logit space to measure agreement between the NCM and the classification head. Our proposed strategy ranks within the top three on two evaluated datasets, showing consistent performance across the two datasets. In contrast, current state-of-the-art methods excel on a single dataset. We achieve an AUROC of 93.41 and 95.35 for African and Swedish animals. The code can be found https://github.com/Applied-Representation-Learning-Lab/OSR.
翻译:当前最先进的野生动物分类模型均在封闭世界设定下训练。当面对未知类别时,这些模型仍会对其预测表现出过度自信。开放集识别旨在分类已知类别的同时拒绝未知样本。已有多种OSR方法通过观察特征空间、逻辑值空间或softmax概率空间来建模封闭集分布。现有方法的一个显著缺陷是需要使用特定于OSR的策略重新训练预训练分类模型。本研究提出一种后处理OSR方法,通过度量模型特征与预测逻辑值之间的一致性来实现开放集识别。我们提出一种基于输入样本到其最近类均值距离的概率分布,并将该NCM分布与逻辑值空间生成的softmax概率进行比较,以度量NCM分类器与分类头之间的一致性。在两个评估数据集上,我们提出的策略均位列前三,显示出跨数据集的一致性能。相比之下,当前最先进的方法仅在单个数据集上表现优异。我们在非洲和瑞典动物数据集上分别取得了93.41和95.35的AUROC值。代码可见https://github.com/Applied-Representation-Learning-Lab/OSR。