通过激活分析解释深神经分类的预测 (Explaining Predictions of Deep Neural Classifier via Activation Analysis)

In many practical applications, deep neural networks have been typically deployed to operate as a black box predictor. Despite the high amount of work on interpretability and high demand on the reliability of these systems, they typically still have to include a human actor in the loop, to validate the decisions and handle unpredictable failures and unexpected corner cases. This is true in particular for failure-critical application domains, such as medical diagnosis. We present a novel approach to explain and support an interpretation of the decision-making process to a human expert operating a deep learning system based on Convolutional Neural Network (CNN). By modeling activation statistics on selected layers of a trained CNN via Gaussian Mixture Models (GMM), we develop a novel perceptual code in binary vector space that describes how the input sample is processed by the CNN. By measuring distances between pairs of samples in this perceptual encoding space, for any new input sample, we can now retrieve a set of most perceptually similar and dissimilar samples from an existing atlas of labeled samples, to support and clarify the decision made by the CNN model. Possible uses of this approach include for example Computer-Aided Diagnosis (CAD) systems working with medical imaging data, such as Magnetic Resonance Imaging (MRI) or Computed Tomography (CT) scans. We demonstrate the viability of our method in the domain of medical imaging for patient condition diagnosis, as the proposed decision explanation method via similar ground truth domain examples (e.g. from existing diagnosis archives) will be interpretable by the operating medical personnel. Our results indicate that our method is capable of detecting distinct prediction strategies that enable us to identify the most similar predictions from an existing atlas.

翻译：在许多实际应用中,深心神经网络通常被作为一种黑盒预测器来运行。尽管在解释性方面做了大量工作,对这些系统可靠性的需求也很高,但通常仍须在循环中包括一个人类行为者,以验证决定,并处理不可预测的故障和意外的角落案例。特别是对于失败关键应用领域,如医学诊断,这是特别如此。我们提出了一个新颖的方法,解释和支持对决策过程的解释,以便由一位专家在以Culual Recial Neal网络(CNN)为基础,运行一个深层次的深入学习系统。通过Gaussian mixtures 诊断模型(GMMMM)对受过训练的CNN的选定层次进行模拟启动统计数据,我们通常在二进制矢量空间开发一个新的概念代码代码,说明输入样品是如何由CNNC处理的。通过任何新的输入抽样样本,我们现在可以从现有的标签样本中提取一组最明显相似和不相近的样本,以便支持和澄清CNN模型做出的决定。我们可能使用这种方法,例如计算机-Regional Regimax 系统,从而显示我们现有的医学方法。