There is an increasing demand for interpretation of model predictions especially in high-risk applications. Various visualization approaches have been proposed to estimate the part of input which is relevant to a specific model prediction. However, most approaches require model structure and parameter details in order to obtain the visualization results, and in general much effort is required to adapt each approach to multiple types of tasks particularly when model backbone and input format change over tasks. In this study, a simple yet effective visualization framework called PAMI is proposed based on the observation that deep learning models often aggregate features from local regions for model predictions. The basic idea is to mask majority of the input and use the corresponding model output as the relative contribution of the preserved input part to the original model prediction. For each input, since only a set of model outputs are collected and aggregated, PAMI does not require any model detail and can be applied to various prediction tasks with different model backbones and input formats. Extensive experiments on multiple tasks confirm the proposed method performs better than existing visualization approaches in more precisely finding class-specific input regions, and when applied to different model backbones and input formats. The source code will be released publicly.
翻译:对模型预测的解释需求日益增加,特别是在高风险应用中。提出了各种可视化方法,以估计与具体模型预测相关的投入部分。然而,大多数方法要求模型结构和参数细节,以便取得可视化结果,一般需要做出大量努力,使每种方法适应多种任务类型,特别是在模型主干和输入格式随任务变化时。在本研究中,根据以下观察,提出了一个简单而有效的可视化框架,称为PAMI,即深层次学习模型往往将当地区域的特征汇总起来,用于模型预测。基本想法是掩盖大部分投入,并使用相应的模型产出,作为保存的投入部分对原始模型预测的相对贡献。对于每一种投入,只要收集和汇总一套模型产出,PAMI并不需要任何模式细节,而是可以适用于不同模型主干和输入格式的各种预测任务。对多种任务的广泛实验证实,拟议的方法在更准确地查找特定类别输入区域,并在应用不同的模型主干和输入格式时,比现有的可视化方法做得更好。源代码将公开发布。