Machine learning researchers have long noted a trade-off between interpretability and prediction performance. On the one hand, traditional models are often interpretable to humans but they cannot achieve high prediction performances. At the opposite end of the spectrum, deep models can achieve state-of-the-art performances in many tasks. However, deep models' predictions are known to be uninterpretable to humans. In this paper we present a framework that shortens the gap between the two aforementioned groups of methods. Given an artificial neural network (ANN), our method finds a Gaussian process (GP) whose predictions almost match those of the ANN. As GPs are highly interpretable, we use the trained GP to explain the ANN's decisions. We use our method to explain ANNs' decisions on may datasets. The explanations provide intriguing insights about the ANNs' decisions. With the best of our knowledge, our inference formulation for GPs is the first one in which an ANN and a similarly behaving Gaussian process naturally appear. Furthermore, we examine some of the known theoretical conditions under which an ANN is interpretable by GPs. Some of those theoretical conditions are too restrictive for modern architectures. However, we hypothesize that only a subset of those theoretical conditions are sufficient. Finally, we implement our framework as a publicly available tool called GPEX. Given any pytorch feed-forward module, GPEX allows users to interpret any ANN subcomponent of the module effortlessly and without having to be involved in the inference algorithm. GPEX is publicly available online:www.github.com/Nilanjan-Ray/gpex
翻译:机器学习的研究人员长期注意到解释性和预测性能之间的权衡。 一方面, 传统模型往往可以对人类进行解释, 但不能达到高预测性能。 在频谱的相反端, 深模型可以在许多任务中达到最先进的表现。 然而, 深模型的预测已知对人类来说是无法解释的。 在本文中, 我们提出了一个框架, 缩短上述两组方法之间的差距。 由于一个人工神经网络( ANN), 我们的方法发现一个高萨进程( GP), 其预测几乎与ANN( ANN) 的预测相匹配。 由于 GP是高度可解释的, 我们使用经过训练的GP来解释AN( ANN) 的决定。 我们用我们的方法解释ANN( AN) 的决定。 根据我们的知识, 我们的GP( GG) 的推论配方是第一个让AN( AN( AN) ) 和 Subhales( CO) 过程自然出现。 此外, 我们使用经过训练的GP( EX) 的理论性框架中的一些条件, 我们使用这些理论性模型的模型的模型, 最终被我们被理解为是用来解释。