Ensemble-based modifications of the well-known SHapley Additive exPlanations (SHAP) method for the local explanation of a black-box model are proposed. The modifications aim to simplify SHAP which is computationally expensive when there is a large number of features. The main idea behind the proposed modifications is to approximate SHAP by an ensemble of SHAPs with a smaller number of features. According to the first modification, called ER-SHAP, several features are randomly selected many times from the feature set, and Shapley values for the features are computed by means of "small" SHAPs. The explanation results are averaged to get the final Shapley values. According to the second modification, called ERW-SHAP, several points are generated around the explained instance for diversity purposes, and results of their explanation are combined with weights depending on distances between points and the explained instance. The third modification, called ER-SHAP-RF, uses the random forest for preliminary explanation of instances and determining a feature probability distribution which is applied to selection of features in the ensemble-based procedure of ER-SHAP. Many numerical experiments illustrating the proposed modifications demonstrate their efficiency and properties for local explanation.
翻译:对众所周知的Shampley Additive Explations (SHAP) 当地解释黑盒模型的方法(SHAP),提出了基于连锁的修改。修改的目的是简化SHAP,如果具有大量特征,则计算成本昂贵。建议修改的主要目的是通过一组具有较少特征的 SHAP 组合组合,将 SHAP 接近 SHAP 。根据第一次修改,称为 ER-SHAP-RF, 从功能集中随机选择了多个特征,而特征的毛细值则通过“小” SHAP 方法计算。解释结果是平均的,以获得最后的毛细值。根据第二次修改,称为战争遗留爆炸物-SHAP,为了多样性的目的,在解释的场合周围产生了几个要点,其解释的结果与权重相结合,取决于点与解释实例之间的距离。第三次修改,称为ER-SHAP-RF,使用随机森林对实例进行初步解释,并确定特征概率分布,用于选择用于选择以数字模型为基础的本地程序,以展示其数字性-SHAP 。