现在是FLAN时间了! (It's FLAN time! Summing feature-wise latent representations for interpretability)

Interpretability has become a necessary feature for machine learning models deployed in critical scenarios, e.g. legal systems, healthcare. In these situations, algorithmic decisions may have (potentially negative) long-lasting effects on the end-user affected by the decision. In many cases, the representational power of deep learning models is not needed, therefore simple and interpretable models (e.g. linear models) should be preferred. However, in high-dimensional and/or complex domains (e.g. computer vision), the universal approximation capabilities of neural networks is required. Inspired by linear models and the Kolmogorov-Arnol representation theorem, we propose a novel class of structurally-constrained neural networks, which we call FLANs (Feature-wise Latent Additive Networks). Crucially, FLANs process each input feature separately, computing for each of them a representation in a common latent space. These feature-wise latent representations are then simply summed, and the aggregated representation is used for prediction. These constraints (which are at the core of the interpretability of linear models) allow an user to estimate the effect of each individual feature independently from the others, enhancing interpretability. In a set of experiments across different domains, we show how without compromising excessively the test performance, the structural constraints proposed in FLANs indeed increase the interpretability of deep learning models.

翻译：解释性已成为在关键情景下部署的机器学习模型的一个必要特征,如法律制度、医疗保健等。在这种情况下,算法决定可能对受决定影响的最终用户产生(潜在负)长期影响。在许多情况下,不需要深层次学习模型的代表性力量,因此,应该倾向于简单和可解释的模式(如线性模型)。然而,在高维和/或复杂领域(如计算机愿景),需要建立神经网络的通用近似能力。在线性模型和科尔莫戈洛-阿尔诺尔代表理论模型的启发下,我们提议建立新型结构上不受限制的神经网络(我们称之为FLANs(Fature-witter-witter Additive网络))的新类别,因此,深层次而言之,FLANs处理每个输入特征,在共同的潜伏空间中分别计算每个特征。然后简单地概括这些特征的潜伏表,然后使用总体代表来进行预测。这些制约因素(是线性模型的核心)使得用户能够评估每个结构上受到限制的神经性网络网络,我们确实地评估了每个特性,从其他方面独立地检验了每个特性。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

专知会员服务

46+阅读 · 2020年3月13日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

32+阅读 · 2019年10月30日