GPEX, 人工神经网络解释框架 (GPEX, A Framework For Interpreting Artificial Neural Networks)

Machine learning researchers have long noted a trade-off between interpretability and prediction performance. On the one hand, traditional models are often interpretable to humans but they cannot achieve high prediction performances. At the opposite end of the spectrum, deep models can achieve state-of-the-art performances in many tasks. However, deep models' predictions are known to be uninterpretable to humans. In this paper we present a framework that shortens the gap between the two aforementioned groups of methods. Given an artificial neural network (ANN), our method finds a Gaussian process (GP) whose predictions almost match those of the ANN. As GPs are highly interpretable, we use the trained GP to explain the ANN's decisions. We use our method to explain ANNs' decisions on may datasets. The explanations provide intriguing insights about the ANNs' decisions. With the best of our knowledge, our inference formulation for GPs is the first one in which an ANN and a similarly behaving Gaussian process naturally appear. Furthermore, we examine some of the known theoretical conditions under which an ANN is interpretable by GPs. Some of those theoretical conditions are too restrictive for modern architectures. However, we hypothesize that only a subset of those theoretical conditions are sufficient. Finally, we implement our framework as a publicly available tool called GPEX. Given any pytorch feed-forward module, GPEX allows users to interpret any ANN subcomponent of the module effortlessly and without having to be involved in the inference algorithm. GPEX is publicly available online:www.github.com/Nilanjan-Ray/gpex

翻译：机器学习的研究人员长期注意到解释性和预测性能之间的权衡。一方面, 传统模型往往可以对人类进行解释, 但不能达到高预测性能。在频谱的相反端, 深模型可以在许多任务中达到最先进的表现。然而, 深模型的预测已知对人类来说是无法解释的。在本文中, 我们提出了一个框架, 缩短上述两组方法之间的差距。由于一个人工神经网络( ANN), 我们的方法发现一个高萨进程( GP), 其预测几乎与ANN( ANN) 的预测相匹配。由于 GP是高度可解释的, 我们使用经过训练的GP来解释AN( ANN) 的决定。我们用我们的方法解释ANN( AN) 的决定。根据我们的知识, 我们的GP( GG) 的推论配方是第一个让AN( AN( AN) ) 和 Subhales( CO) 过程自然出现。此外, 我们使用经过训练的GP( EX) 的理论性框架中的一些条件, 我们使用这些理论性模型的模型的模型, 最终被我们被理解为是用来解释。

相关内容

人工神经网络

关注 131

人工神经网络（Artificial Neural Network，即ANN），它从信息处理角度对人脑神经元网络进行抽象，建立某种简单模型，按不同的连接方式组成不同的网络。在工程与学术界也常直接简称为神经网络或类神经网络。神经网络是一种运算模型，由大量的节点（或称神经元）之间相互联接构成。每个节点代表一种特定的输出函数，称为激励函数（activation function）。每两个节点间的连接都代表一个对于通过该连接信号的加权值，称之为权重，这相当于人工神经网络的记忆。网络的输出则依网络的连接方式，权重值和激励函数的不同而不同。而网络自身通常都是对自然界某种算法或者函数的逼近，也可能是对一种逻辑策略的表达。

专知会员服务

39+阅读 · 2020年11月3日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

《可解释的机器学习-interpretable-ml》238页pdf