The capability of reliably detecting out-of-distribution samples is one of the key factors in deploying a good classifier, as the test distribution always does not match with the training distribution in most real-world applications. In this work, we present a deep multi-class data description, termed as Deep-MCDD, which is effective to detect out-of-distribution (OOD) samples as well as classify in-distribution (ID) samples. Unlike the softmax classifier that only focuses on the linear decision boundary partitioning its latent space into multiple regions, our Deep-MCDD aims to find a spherical decision boundary for each class which determines whether a test sample belongs to the class or not. By integrating the concept of Gaussian discriminant analysis into deep neural networks, we propose a deep learning objective to learn class-conditional distributions that are explicitly modeled as separable Gaussian distributions. Thereby, we can define the confidence score by the distance of a test sample from each class-conditional distribution, and utilize it for identifying OOD samples. Our empirical evaluation on multi-class tabular and image datasets demonstrates that Deep-MCDD achieves the best performances in distinguishing OOD samples while showing the classification accuracy as high as the other competitors.
翻译:在这项工作中,我们提出了一个深层次的多级数据描述,称为Deep-MCDD, 用于探测分配之外的样本,以及分类分布(ID)样本。与仅侧重于线性决定边界将潜在空间分割到多个区域的软式马克斯分类器不同,我们的深色MCDD旨在为每一类找到一个确定试样是否属于该类的球类决定边界,确定试样是否属于该类的样本。我们通过将高斯对异相分析的概念纳入深层神经网络,我们提出了一个深层次学习目标,以学习明确模拟为分解分布的分类条件分布。因此,我们可以根据每个类分布的测试样品距离确定信任度分数,并利用它来确定ODD样本。我们对多级表和图像分析的实证评价,表明深色MC的精确性能,同时显示其他样本的精确性能。