Commonly used AI networks are very self-confident in their predictions, even when the evidence for a certain decision is dubious. The investigation of a deep learning model output is pivotal for understanding its decision processes and assessing its capabilities and limitations. By analyzing the distributions of raw network output vectors, it can be observed that each class has its own decision boundary and, thus, the same raw output value has different support for different classes. Inspired by this fact, we have developed a new method for out-of-distribution detection. The method offers an explanatory step beyond simple thresholding of the softmax output towards understanding and interpretation of the model learning process and its output. Instead of assigning the class label of the highest logit to each new sample presented to the network, it takes the distributions over all classes into consideration. A probability score interpreter (PSI) is created based on the joint logit values in relation to their respective correct vs wrong class distributions. The PSI suggests whether the sample is likely to belong to a specific class, whether the network is unsure, or whether the sample is likely an outlier or unknown type for the network. The simple PSI has the benefit of being applicable on already trained networks. The distributions for correct vs wrong class for each output node are established by simply running the training examples through the trained network. We demonstrate our OOD detection method on a challenging transmission electron microscopy virus image dataset. We simulate a real-world application in which images of virus types unknown to a trained virus classifier, yet acquired with the same procedures and instruments, constitute the OOD samples.
翻译:通常使用的 AI 网络在预测中非常自信, 即使某些决定的证据可疑。 调查深学习模型输出对于理解其决策过程和评估其能力和局限性至关重要。 通过分析原始网络输出矢量的分布, 可以观察到, 每个类都有自己的决定界限, 因此, 不同的类都有不同的原始产出值支持。 受这个事实的启发, 我们开发了一种新的超出分配范围的检测方法。 这个方法为理解和解释模型学习过程及其输出提供了一个解释性步骤, 超越软模输出的简单临界值, 从而理解和解释模型学习过程及其输出。 深度学习模型输出对于理解和解释至关重要。 向网络提供的每个新样本指定最高日志的分类标签, 而不是将所有分类的分布考虑在内。 概率评分( PSI ) 是根据它们各自的正确和错误的分类分布值来创建的。 PSI 表明样本是否属于某个特定类, 网络是否不确定, 或样本可能是网络的外部或未知类型 。 简单的 OSI 将我们所训练的机变的机 。 通过我们所训练的机的机变的机变的机变的机法 。 显示我们所训练的机变的机变的机变的机的机的机变的机变的机 。 。 我们的机的机的机变的机变的机变的机变的机变的机变的机变的机变的机变的机变的机变的机变的机变的机变的机。