粗眼2Fine:精细视觉分类的两阶段培训方法 (Coarse2Fine: A Two-stage Training Method for Fine-grained Visual Classification)

Small inter-class and large intra-class variations are the main challenges in fine-grained visual classification. Objects from different classes share visually similar structures and objects in the same class can have different poses and viewpoints. Therefore, the proper extraction of discriminative local features (e.g. bird's beak or car's headlight) is crucial. Most of the recent successes on this problem are based upon the attention models which can localize and attend the local discriminative objects parts. In this work, we propose a training method for visual attention networks, Coarse2Fine, which creates a differentiable path from the input space to the attended feature maps. Coarse2Fine learns an inverse mapping function from the attended feature maps to the informative regions in the raw image, which will guide the attention maps to better attend the fine-grained features. We show Coarse2Fine and orthogonal initialization of the attention weights can surpass the state-of-the-art accuracies on common fine-grained classification tasks.

翻译：小类之间和大类内部差异是细细视觉分类的主要挑战。不同类别中具有视觉相似结构和对象的不同对象在同类中具有不同的外形和观点。因此,正确提取歧视性的地方特征(如鸟嘴或汽车头灯)至关重要。这一问题最近的成功大多基于关注模型,这些模型可以将地方性地方化并处理地方性对象部分。在这项工作中,我们提出了视觉关注网络的培训方法,即Coarse2Fine,它从输入空间到参与的地貌图,可以产生不同的路径。 Coarse2Fine从参与的地貌地图到原始图像中的信息丰富的区域学习了反向映射功能,它将引导关注地图更好地关注细微特征。我们展示了Corarse2Fine 和对关注重量的条形初始化可以超过常见精细分类任务中的最新技术。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

专知会员服务

99+阅读 · 2020年7月3日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【Google】监督对比学习，Supervised Contrastive Learning

专知会员服务

75+阅读 · 2020年4月24日