BR-NPA:提高注意可解释性的非几何高分辨率注意模式 (BR-NPA: A Non-Parametric High-Resolution Attention Model to improve the Interpretability of Attention)

The prevalence of employing attention mechanisms has brought along concerns on the interpretability of attention distributions. Although it provides insights about how a model is operating, utilizing attention as the explanation of model predictions is still highly dubious. The community is still seeking more interpretable strategies for better identifying local active regions that contribute the most to the final decision. To improve the interpretability of existing attention models, we propose a novel Bilinear Representative Non-Parametric Attention (BR-NPA) strategy that captures the task-relevant human-interpretable information. The target model is first distilled to have higher-resolution intermediate feature maps. From which, representative features are then grouped based on local pairwise feature similarity, to produce finer-grained, more precise attention maps highlighting task-relevant parts of the input. The obtained attention maps are ranked according to the activity level of the compound feature, which provides information regarding the important level of the highlighted regions. The proposed model can be easily adapted in a wide variety of modern deep models, where classification is involved. Extensive quantitative and qualitative experiments showcase more comprehensive and accurate visual explanations compared to state-of-the-art attention models and visualizations methods across multiple tasks including fine-grained image classification, few-shot classification, and person re-identification, without compromising the classification accuracy. The proposed visualization model sheds imperative light on how neural networks `pay their attention' differently in different tasks.

翻译：利用关注机制的普遍程度使人们对关注分布的可解释性产生了关切,虽然它使人们对模型的运作方式有了深刻的认识,但利用模型预测模型的解释仍然非常可疑;社区仍在寻求更可解释的战略,以更好地确定对最终决定贡献最大的地方活跃区域;为了改进现有关注模式的可解释性,我们提议了一个新颖的双线代表非定位关注(BR-NPA)战略,以捕捉与任务相关的人类解释信息;目标模型首先蒸发,以获得更高分辨率的中间特征图。从中,然后根据当地对称特征的相似性将代表性特征分组,以制作精细的、更精确的注意地图,突出与任务相关的部分;为了改进现有关注模式的可理解性,我们提出了关于突出区域重要程度的信息;提议的模型很容易在涉及分类的多种现代深度模型中进行调整;广泛的定量和定性实验,展示了与州级的对等特征特征特征特征相似的更全面和准确的直观解释,从而在不以不同视角的图像分类中,在不同的图像分类中,包括拟议的对不同的图像分类中,对不同的视觉分类,对不同的图像进行细致的分类。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

Python计算导论，560页pdf，Introduction to Computing Using Python

专知会员服务

76+阅读 · 2020年5月5日