Because of their strong theoretical properties, Shapley values have become very popular as a way to explain predictions made by black box models. Unfortuately, most existing techniques to compute Shapley values are computationally very expensive. We propose PDD-SHAP, an algorithm that uses an ANOVA-based functional decomposition model to approximate the black-box model being explained. This allows us to calculate Shapley values orders of magnitude faster than existing methods for large datasets, significantly reducing the amortized cost of computing Shapley values when many predictions need to be explained.
翻译:Shapley值因其强烈的理论属性而变得非常受欢迎,成为解释黑盒模型预测的一种方法。 想不到的是,大多数计算黄盒值的现有技术都是在计算上非常昂贵的。 我们提出PDD-SHAP算法,这个算法使用基于 ANOVA 的功能分解模型来接近正在解释的黑盒模型。 这使我们能够计算比现有的大型数据集方法更快的黄盒值数量级,大大降低了在需要解释许多预测时计算黄盒值的摊销成本。