精细可视视力分类可解释注意引导网络 (Interpretable Attention Guided Network for Fine-grained Visual Classification)

Fine-grained visual classification (FGVC) is challenging but more critical than traditional classification tasks. It requires distinguishing different subcategories with the inherently subtle intra-class object variations. Previous works focus on enhancing the feature representation ability using multiple granularities and discriminative regions based on the attention strategy or bounding boxes. However, these methods highly rely on deep neural networks which lack interpretability. We propose an Interpretable Attention Guided Network (IAGN) for fine-grained visual classification. The contributions of our method include: i) an attention guided framework which can guide the network to extract discriminitive regions in an interpretable way; ii) a progressive training mechanism obtained to distill knowledge stage by stage to fuse features of various granularities; iii) the first interpretable FGVC method with a competitive performance on several standard FGVC benchmark datasets.

翻译：精细的视觉分类(FGVC)具有挑战性,但比传统分类任务更为关键,它要求区分不同亚类,并区分本类内物体固有的微妙变异。以前的工作重点是利用关注策略或捆绑框,利用多种颗粒和歧视性区域,提高特征代表能力。但是,这些方法高度依赖缺乏解释性的深层神经网络。我们建议为精细的视觉分类建立一个可解释关注引导网络(IGN)。我们的方法的贡献包括:(一) 关注引导框架,该框架可以指导网络以可解释的方式提取共聚区域;(二) 逐步建立培训机制,通过阶段提炼知识阶段,将各种颗粒的特性结合起来;(三) 第一个可解释的FGVC方法,在几个标准的FGVC基准数据集上具有竞争性性。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【CVPR2021】动态度量学习

专知会员服务

41+阅读 · 2021年3月30日

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

专知会员服务

63+阅读 · 2020年7月12日

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

专知会员服务

96+阅读 · 2020年4月18日

【上海交大】可解释CNN的对象分类，Interpretable CNNs for Object Classification

专知会员服务

54+阅读 · 2020年3月14日