One of the most popular methods of the machine learning prediction explanation is the SHapley Additive exPlanations method (SHAP). An imprecise SHAP as a modification of the original SHAP is proposed for cases when the class probability distributions are imprecise and represented by sets of distributions. The first idea behind the imprecise SHAP is a new approach for computing the marginal contribution of a feature, which fulfils the important efficiency property of Shapley values. The second idea is an attempt to consider a general approach to calculating and reducing interval-valued Shapley values, which is similar to the idea of reachable probability intervals in the imprecise probability theory. A simple special implementation of the general approach in the form of linear optimization problems is proposed, which is based on using the Kolmogorov-Smirnov distance and imprecise contamination models. Numerical examples with synthetic and real data illustrate the imprecise SHAP.
翻译:机器学习预测解释最流行的方法之一是Shamapley Additive Explanations(SHAP) 。对于班级概率分布不精确且由数组分布代表的分类概率分布不精确的情况,建议对原SHAP进行不精确的修改。不精确 SHAP的第一个想法是计算某一特性的边际贡献的新方法,该特性满足了Shapley价值的重要效率特性。第二个想法是试图考虑一种计算和减少间隔值的Shapley值的一般方法,该方法类似于不精确概率理论中可达概率间隔的构想。提出了以线性优化问题为形式的简单特别实施一般方法,其形式是使用Kolmogorov-Smirnov的距离和不精确的污染模型。合成和真实数据中的数字实例说明了不精确的SHAP。