Saliency methods generating visual explanatory maps representing the importance of image pixels for model classification is a popular technique for explaining neural network decisions. Hierarchical dynamic masks (HDM), a novel explanatory maps generation method, is proposed in this paper to enhance the granularity and comprehensiveness of saliency maps. First, we suggest the dynamic masks (DM), which enables multiple small-sized benchmark mask vectors to roughly learn the critical information in the image through an optimization method. Then the benchmark mask vectors guide the learning of large-sized auxiliary mask vectors so that their superimposed mask can accurately learn fine-grained pixel importance information and reduce the sensitivity to adversarial perturbations. In addition, we construct the HDM by concatenating DM modules. These DM modules are used to find and fuse the regions of interest in the remaining neural network classification decisions in the mask image in a learning-based way. Since HDM forces DM to perform importance analysis in different areas, it makes the fused saliency map more comprehensive. The proposed method outperformed previous approaches significantly in terms of recognition and localization capabilities when tested on natural and medical datasets.
翻译:显示显示图像像素对模型分类重要性的视觉解释性图象的清晰度方法,是解释神经网络决定的一种流行技术。本文件建议采用等级动态遮罩(HDM)这一新型的解释性地图生成方法,以加强显要地图的颗粒性和全面性。首先,我们建议采用动态遮罩(DM),使多个小型基准遮罩矢量能够通过优化方法对图像中的关键信息进行粗略了解。然后,基准遮罩矢量指导大型辅助遮罩的学习,以便其超大型辅助遮罩能够准确地了解细微像素重要性信息,并降低对对抗性扰动的敏感度。此外,我们通过配置DM模块来构建HDM。这些DM模块用来查找和整合在以学习为基础的蒙面图像中其余的神经网络分类决定中感兴趣的区域。由于HDM要求DM在不同区域进行重要分析,因此使引信突出的地图更加全面。拟议方法在自然和医学数据设置测试时,在识别和本地化能力方面大大超越了先前的方法。