现在是FLAN时间了! (It's FLAN time! Summing feature-wise latent representations for interpretability)

Interpretability has become a necessary feature for machine learning models deployed in critical scenarios, e.g. legal system, healthcare. In these situations, algorithmic decisions may have (potentially negative) long-lasting effects on the end-user affected by the decision. In many cases, the representational power of deep learning models is not needed, therefore simple and interpretable models (e.g. linear models) should be preferred. However, in high-dimensional and/or complex domains (e.g. computer vision), the universal approximation capabilities of neural networks are required. Inspired by linear models and the Kolmogorov-Arnold representation theorem, we propose a novel class of structurally-constrained neural networks, which we call FLANs (Feature-wise Latent Additive Networks). Crucially, FLANs process each input feature separately, computing for each of them a representation in a common latent space. These feature-wise latent representations are then simply summed, and the aggregated representation is used for prediction. These constraints (which are at the core of the interpretability of linear models) allow a user to estimate the effect of each individual feature independently from the others, enhancing interpretability. In a set of experiments across different domains, we show how without compromising excessively the test performance, the structural constraints proposed in FLANs indeed facilitates the interpretability of deep learning models. We quantitatively compare FLANs interpretability to post-hoc methods using recently introduced metrics, discussing the advantages of natively interpretable models over a post-hoc analysis.

翻译：解释性已成为在关键情景(如法律制度、医疗保健)中部署的机器学习模型的一个必要特征。在这种情况下,算法决定可能对受决定影响的终端用户产生(潜在负)长期影响。在许多情况下,不需要深层次学习模型的代表性力量,因此,应该倾向于简单和可解释的模式(如线性模型)。然而,在高维和(或)复杂领域(如计算机愿景),需要建立神经网络的通用近似能力。在线性模型和科尔莫戈洛夫-阿诺尔德代言方的启发下,我们提出一个结构上不受限制的神经网络的新型类(可能为负)长期影响。我们称之为FLANs(Fature-with Lent Additive 网络) 在许多情况下,FLANs处理每种输入特征,在共同的潜伏空间中分别计算一个代表。这些特征的潜伏性表述随后简单地加以概括,并使用总体代表来进行预测。这些制约因素(是线性模型的核心)使得用户能够对结构上受限制性进行更深层次的网络分析,而不用独立地对每个域域域进行解释。我们所拟议的弹性解释,从其他特性来独立地展示了各种可变性解释。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

【开放书】预测模型:探索、解释和调试，以人为本的可解释机器学习，Predictive Models: Explore, Explain, and Debug，Human-Centered Interpretable Machine Learning

专知会员服务

37+阅读 · 2019年12月26日