SHAP得分解释的复杂性：通过知识编译的可处理性和非逼近性结果来确定复杂性 (On the Complexity of SHAP-Score-Based Explanations: Tractability via Knowledge Compilation and Non-Approximability Results)

In Machine Learning, the $\mathsf{SHAP}$-score is a version of the Shapley value that is used to explain the result of a learned model on a specific entity by assigning a score to every feature. While in general computing Shapley values is an intractable problem, we prove a strong positive result stating that the $\mathsf{SHAP}$-score can be computed in polynomial time over deterministic and decomposable Boolean circuits. Such circuits are studied in the field of Knowledge Compilation and generalize a wide range of Boolean circuits and binary decision diagrams classes, including binary decision trees and Ordered Binary Decision Diagrams (OBDDs). We also establish the computational limits of the SHAP-score by observing that computing it over a class of Boolean models is always polynomially as hard as the model counting problem for that class. This implies that both determinism and decomposability are essential properties for the circuits that we consider. It also implies that computing $\mathsf{SHAP}$-scores is intractable as well over the class of propositional formulas in DNF. Based on this negative result, we look for the existence of fully-polynomial randomized approximation schemes (FPRAS) for computing $\mathsf{SHAP}$-scores over such class. In contrast to the model counting problem for DNF formulas, which admits an FPRAS, we prove that no such FPRAS exists for the computation of $\mathsf{SHAP}$-scores. Surprisingly, this negative result holds even for the class of monotone formulas in DNF. These techniques can be further extended to prove another strong negative result: Under widely believed complexity assumptions, there is no polynomial-time algorithm that checks, given a monotone DNF formula $\varphi$ and features $x,y$, whether the $\mathsf{SHAP}$-score of $x$ in $\varphi$ is smaller than the $\mathsf{SHAP}$-score of $y$ in $\varphi$.

翻译：---- 在机器学习中，$\mathsf{SHAP}$得分是Shapley值的一个版本，用于解释关于特定实例的学习模型结果，通过为每个特征分配一个得分。虽然通常计算Shapley值是一个难以处理的问题，但我们证明了一个强大的正面结果，即可以在确定性和可分解布尔电路上通过多项式时间计算$\mathsf{SHAP}$得分。这些电路在知识编译领域中研究，广泛涵盖各种布尔电路和二叉决策图类，包括二叉决策树和有序二叉决策图（OBDD）。我们还通过观察计算该得分在某些Boolean模型上总是多项式时间困难证明了SHAP得分的计算限制。这意味着决定性和可分离性是我们考虑的电路必需的属性。这也意味着，即使在DNF范式的命题公式类中，计算$\mathsf{SHAP}$得分也是困难的。基于这个负面结果，我们寻找计算此类得分的完全多项式随机逼近方案（FPRAS）的存在。与DNF公式的模型计数问题相比，其接受了一个FPRAS，我们证明了没有FPRAS存在，用于计算$\mathsf{SHAP}$得分。令人惊讶的是，即使对于DNF中基本单调函数的类，这个负面结果也成立。这些技术可以进一步扩展为证明另一个强大的负面结果：在广泛认为的复杂性假设下，不存在一种多项式时间算法，它可以检查给定的单调DNF公式$\varphi$和特征$x$，$y$的$\mathsf{SHAP}$得分是否小于$\varphi$中$y$的$\mathsf{SHAP}$得分。