The completeness axiom renders the explanation of a post-hoc XAI method only locally faithful to the model, i.e. for a single decision. For the trustworthy application of XAI, in particular for high-stake decisions, a more global model understanding is required. Recently, concept-based methods have been proposed, which are however not guaranteed to be bound to the actual model reasoning. To circumvent this problem, we propose Multi-dimensional Concept Discovery (MCD) as an extension of previous approaches that fulfills a completeness relation on the level of concepts. Our method starts from general linear subspaces as concepts and does neither require reinforcing concept interpretability nor re-training of model parts. We propose sparse subspace clustering to discover improved concepts and fully leverage the potential of multi-dimensional subspaces. MCD offers two complementary analysis tools for concepts in input space: (1) concept activation maps, that show where a concept is expressed within a sample, allowing for concept characterization through prototypical samples, and (2) concept relevance heatmaps, that decompose the model decision into concept contributions. Both tools together enable a detailed understanding of the model reasoning, which is guaranteed to relate to the model via a completeness relation. This paves the way towards more trustworthy concept-based XAI. We empirically demonstrate the superiority of MCD against more constrained concept definitions.
翻译:完整性轴使对超常 XAI 后方法的解释只能是当地对模型的忠实解释,即只对单一决定。对于XAI 的可信赖应用,特别是对于高层决策来说,需要一种更全球性的模型理解。最近,提出了基于概念的方法,但不能保证与实际模型推理相联。为绕过这个问题,我们提议多维概念发现(MCD)作为以前方法的延伸,该方法满足概念水平的完整性关系。我们的方法从一般线性子空间开始,作为概念,不需要加强概念的解释性,也不需要对模型部分进行再培训。我们建议稀疏的子空间集群,以发现改进的概念并充分利用多维次空间的潜力。MCD为输入空间的概念提供了两种辅助性分析工具:(1) 概念激活图,该图显示在抽样中表达的概念,允许通过典型样本对概念进行定性,和(2) 概念相关性的热测图,将模型决定分解成概念贡献。这两种工具一起能够详细理解模型推理,而模型推理学则更可靠地与MI 概念相联系。我们保证通过更可靠的方式展示了模型的完整性。