CNN非消极概念动动矢量模型的不可逆概念解释 (Invertible Concept-based Explanations for CNN Models with Non-negative Concept Activation Vectors)

Convolutional neural network (CNN) models for computer vision are powerful but lack explainability in their most basic form. This deficiency remains a key challenge when applying CNNs in important domains. Recent work for explanations through feature importance of approximate linear models has moved from input-level features (pixels or segments) to features from mid-layer feature maps in the form of concept activation vectors (CAVs). CAVs contain concept-level information and could be learnt via clustering. In this work, we rethink the ACE algorithm of Ghorbani et al., proposing an alternative inevitable concept-based explanation (ICE) framework to overcome its shortcomings. Based on the requirements of fidelity (approximate models to target models) and interpretability (being meaningful to people), we design measurements and evaluate a range of matrix factorization methods with our framework. We find that \emph{non-negative concept activation vectors} (NCAVs) from non-negative matrix factorization provide superior performance in interpretability and fidelity based on computational and human subject experiments. Our framework provides both local and global concept-level explanations for pre-trained CNN models.

翻译：计算机视觉的进化神经网络(CNN)模型非常强大,但缺乏最基本的解释。在应用CNN在重要领域使用CNN时,这一缺陷仍然是一个关键的挑战。最近通过近似线性模型的显著重要性进行解释的工作已经从输入层面的特征(像素或片段)转变为从中层特征图中以概念激活矢量(CAVs)形式出现的特征。CAV包含概念层面的信息,可以通过集群学习。在这项工作中,我们重新思考Ghorbani等人的ACE算法,提出了克服其缺陷的基于概念的不可避免的解释框架。根据对忠诚的要求(对目标模型的近似模型)和可解释性(对人有意义的),我们设计测量和评估了与我们框架相关的一系列矩阵要素化方法。我们发现,来自非否定性矩阵要素化的NCAVS提供了基于计算和人类实验的更高可解释性和真实性。我们的框架为培训前模式提供了当地和全球概念层面的解释。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

专知会员服务

46+阅读 · 2020年3月13日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

专知会员服务

14+阅读 · 2020年1月1日

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

专知会员服务

6+阅读 · 2019年12月1日