Local feature attribution methods are increasingly used to explain complex machine learning models. However, current methods are limited because they are extremely expensive to compute or are not capable of explaining a distributed series of models where each model is owned by a separate institution. The latter is particularly important because it often arises in finance where explanations are mandated. Here, we present DeepSHAP, a tractable method to propagate local feature attributions through complex series of models based on a connection to the Shapley value. We evaluate DeepSHAP across biological, health, and financial datasets to show that it provides equally salient explanations an order of magnitude faster than existing model-agnostic attribution techniques and demonstrate its use in an important distributed series of models setting.
翻译:本地特性归属方法越来越多地用于解释复杂的机器学习模式,但是,目前的方法是有限的,因为它们计算费用极其昂贵,或者无法解释每个模型都由单独机构拥有的分布式系列模型,后者特别重要,因为它往往在财务中产生,而财务中要求解释。这里,我们介绍了DeepSHAP,这是通过基于与Shapley值连接的复杂系列模型传播本地特性属性的一种可移植方法。我们评估了不同生物、健康和财务数据集的深层SHAP,以表明它所提供的同样显著的解释,其数量级比现有的模型-不可知性归属技术要快,并展示其在重要的分布式系列模型设置中的使用情况。