This paper presents a principled approach for detecting out-of-distribution (OOD) samples in deep neural networks (DNN). Modeling probability distributions on deep features has recently emerged as an effective, yet computationally cheap method to detect OOD samples in DNN. However, the features produced by a DNN at any given layer do not fully occupy the corresponding high-dimensional feature space. We apply linear statistical dimensionality reduction techniques and nonlinear manifold-learning techniques on the high-dimensional features in order to capture the true subspace spanned by the features. We hypothesize that such lower-dimensional feature embeddings can mitigate the curse of dimensionality, and enhance any feature-based method for more efficient and effective performance. In the context of uncertainty estimation and OOD, we show that the log-likelihood score obtained from the distributions learnt on this lower-dimensional subspace is more discriminative for OOD detection. We also show that the feature reconstruction error, which is the $L_2$-norm of the difference between the original feature and the pre-image of its embedding, is highly effective for OOD detection and in some cases superior to the log-likelihood scores. The benefits of our approach are demonstrated on image features by detecting OOD images, using popular DNN architectures on commonly used image datasets such as CIFAR10, CIFAR100, and SVHN.
翻译:本文介绍了在深神经网络中探测分布(OOD)样本的原则性方法。最近,在DNN中,测得深度特征的模型概率分布作为一种有效、但计算成本低的方法,在DNN中发现OOD样本。然而,在任何特定层,DNN生成的特征并不完全占据相应的高维特征空间。我们在高维特征上应用线性统计维度减少技术和非线性多重学习技术,以捕捉这些特征所覆盖的真正子空间。我们假设,这种低维特征嵌入能够减轻维度的诅咒,并增强任何基于地貌的更高效和有效性表现方法。在不确定性估计和OOOD中,我们显示从这一低维次空间分布中获得的日志相似性评分对于OD检测来说更具歧视性。我们还表明,特征重建错误,即原始特征和其嵌入前值之间的差异为$L_2N-10美元,对于OODOD检测工作来说非常有效,在SROD模型上展示了我们用于SARD图像的高级特征。