Shapley values, which were originally designed to assign attributions to individual players in coalition games, have become a commonly used approach in explainable machine learning to provide attributions to input features for black-box machine learning models. A key attraction of Shapley values is that they uniquely satisfy a very natural set of axiomatic properties. However, extending the Shapley value to assigning attributions to interactions rather than individual players, an interaction index, is non-trivial: as the natural set of axioms for the original Shapley values, extended to the context of interactions, no longer specify a unique interaction index. Many proposals thus introduce additional less ''natural'' axioms, while sacrificing the key axiom of efficiency, in order to obtain unique interaction indices. In this work, rather than introduce additional conflicting axioms, we adopt the viewpoint of Shapley values as coefficients of the most faithful linear approximation to the pseudo-Boolean coalition game value function. By extending linear to $\ell$-order polynomial approximations, we can then define the general family of faithful interaction indices. We show that by additionally requiring the faithful interaction indices to satisfy interaction-extensions of the standard individual Shapley axioms (dummy, symmetry, linearity, and efficiency), we obtain a unique Faithful Shapley Interaction index, which we denote Faith-Shap, as a natural generalization of the Shapley value to interactions. We then provide some illustrative contrasts of Faith-Shap with previously proposed interaction indices, and further investigate some of its interesting algebraic properties. We further show the computational efficiency of computing Faith-Shap, together with some additional qualitative insights, via some illustrative experiments.
翻译:Shapley值最初是为了分配合作博弈中每个玩家的出众表现而设计的,现在已经成为黑盒机器学习模型可解释性的常见方法之一,用于为输入特征提供归属。Shapley值的一个关键优点是,它们唯一满足一组非常自然的公理性质。然而,将Shapley值扩展到为交互作用而不是个别玩家分配归属的交互作用指数是非常棘手的:由于原始Shapley值的自然公理集在交互作用的上下文中不再指定唯一的交互作用指数。因此,许多提议引入附加的非“自然”公理,而牺牲效率这一关键公理,则可以获得唯一的交互作用指数。在这项工作中,我们不是引入额外的冲突公理,而是采用Shapley值作为伪布尔联盟博弈价值函数的最忠实线性近似系数的观点。通过将线性推广到$\ell$-阶多项式近似,我们可以定义忠实交互作用指数的通用系列。我们展示了通过额外要求忠实交互作用指数满足标准个人Shapley公理(虚拟、对称、线性和效率)的交互作用扩展,得到了唯一的忠实Shapley交互作用指数,我们将其称为Faith-Shap。然后,我们通过一些说明性实验进一步研究了Faith-Shap与之前提出的交互作用指数的一些对比,以及它的一些有趣的代数属性。我们还展示了计算Faith-Shap的计算效率,以及通过一些说明性实验的一些附加定性洞察。