判读热动力学</s> (Thermodynamics of Interpretation)

Over the past few years, different types of data-driven Artificial Intelligence (AI) techniques have been widely adopted in various domains of science for generating predictive models. However, because of their black-box nature, it is crucial to establish trust in these models before accepting them as accurate. One way of achieving this goal is through the implementation of a post-hoc interpretation scheme that can put forward the reasons behind a black-box model's prediction. In this work, we propose a classical thermodynamics inspired approach for this purpose: Thermodynamically Explainable Representations of AI and other black-box Paradigms (TERP). TERP works by constructing a linear, local surrogate model that approximates the behaviour of the black-box model within a small neighborhood around the instance being explained. By employing a simple forward feature selection algorithm, TERP assigns an interpretability score to all the possible surrogate models. Compared to existing methods, TERP improves interpretability by selecting an optimal interpretation from these models by drawing simple parallels with classical thermodynamics. To validate TERP as a generally applicable method, we successfully demonstrate how it can be used to obtain interpretations of a wide range of black-box model architectures including deep learning Autoencoders, Recurrent neural networks and Convolutional neural networks applied to different domains including molecular simulations, image, and text classification respectively.

翻译：过去几年来,不同科学领域广泛采用了不同种类的数据驱动人工智能(AI)技术,以产生预测模型。然而,由于其黑箱性质,在接受这些模型为准确性之前,必须对这些模型建立信任。实现这一目标的一种方法是实施热后解释方案,这种方案可以提出黑箱模型预测背后的原因。在这项工作中,我们提议了一种典型的热动力学激励方法,用于此目的:AI和其他黑箱模型的热动力可解释表达法。由于这些模型具有黑箱性质,因此在将这些模型视为准确性之前,必须对这些模型建立信任。通过采用简单的前方特征选择算法,TERP为所有可能的黑箱模型预测提供解释性评分。与现有方法相比,TERP通过与古典热力模型的简单相似,从这些模型中选择最佳解释性解释性。要将TRPA作为普遍适用的模型方法,我们成功地展示了一种直线式、地方替代模型的模型,包括深层次的模型,我们成功地展示了它是如何被应用到深层次的图像网络的,包括深层次的系统。</s>

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日