The black-box nature of Deep Neural Networks (DNNs) severely hinders its performance improvement and application in specific scenes. In recent years, class activation mapping-based method has been widely used to interpret the internal decisions of models in computer vision tasks. However, when this method uses backpropagation to obtain gradients, it will cause noise in the saliency map, and even locate features that are irrelevant to decisions. In this paper, we propose an Absolute value Class Activation Mapping-based (Abs-CAM) method, which optimizes the gradients derived from the backpropagation and turns all of them into positive gradients to enhance the visual features of output neurons' activation, and improve the localization ability of the saliency map. The framework of Abs-CAM is divided into two phases: generating initial saliency map and generating final saliency map. The first phase improves the localization ability of the saliency map by optimizing the gradient, and the second phase linearly combines the initial saliency map with the original image to enhance the semantic information of the saliency map. We conduct qualitative and quantitative evaluation of the proposed method, including Deletion, Insertion, and Pointing Game. The experimental results show that the Abs-CAM can obviously eliminate the noise in the saliency map, and can better locate the features related to decisions, and is superior to the previous methods in recognition and localization tasks.
翻译:深神经网络(DNNS)的黑箱性质严重妨碍其性能的改进和在特定场景中的应用。近年来,基于阶级激活映像法被广泛用于解释计算机视觉任务中模型的内部决定。然而,当该方法使用回映来获取梯度时,它将在突出图中引起噪音,甚至定位与决定无关的特征。在本文件中,我们提出了一个基于绝对值的分类定位映像法(Abs-CAM),该方法优化了从后向调整得出的梯度,并将所有梯度变为正梯度,以加强输出神经神经元激活的视觉特征,并提高突出图的本地化能力。Abs-CAM的框架将分为两个阶段:生成初始突出图,生成与决定无关的最后突出图。在第一阶段,通过优化梯度,将初始显性地图与原始图像结合起来,以加强突出性地图的语义化信息。我们从质量和定量角度评估了显著的图像特征,我们从质量和数量角度评估了与定位相关的游戏方法, 展示了先前的定位和定位方法, 展示了先前的精确度和精确度评估方法,可以消除了先前的定位。