Feature attributions are a common paradigm for model explanations due to their simplicity in assigning a single numeric score for each input feature to a model. In the actionable recourse setting, wherein the goal of the explanations is to improve outcomes for model consumers, it is often unclear how feature attributions should be correctly used. With this work, we aim to strengthen and clarify the link between actionable recourse and feature attributions. Concretely, we propose a variant of SHAP, Counterfactual SHAP (CF-SHAP), that incorporates counterfactual information to produce a background dataset for use within the marginal (a.k.a. interventional) Shapley value framework. We motivate the need within the actionable recourse setting for careful consideration of background datasets when using Shapley values for feature attributions with numerous synthetic examples. Moreover, we demonstrate the efficacy of CF-SHAP by proposing and justifying a quantitative score for feature attributions, counterfactual-ability, showing that as measured by this metric, CF-SHAP is superior to existing methods when evaluated on public datasets using tree ensembles.
翻译:特性属性是模型解释的一个常见范例,因为对模型的每个输入特性指定一个单一数字分数是简单的。在可操作的追索环境中,解释的目的是改善模型消费者的结果,在可操作的追索环境中,往往不清楚应如何正确使用特性属性;在这项工作中,我们的目标是加强和澄清可操作的追索和特性属性之间的联系。具体地说,我们提出了SHAP的变式,即CF-SHAP(反事实 SHAP),它包括反事实信息,以产生背景数据集,供边际(a.k.a.干涉性)弱点值框架使用。我们在可操作的追索环境中,在使用图示值作为特征属性属性时,需要仔细考虑背景数据集,并举许多合成例子。此外,我们通过提出和说明特征属性属性的定量分数,反事实可性,表明CF-SHAP在使用树套来评价公共数据集时,比现有的方法要好。