弱受监督对象本地化的微小关注 (Fine-Grained Attention for Weakly Supervised Object Localization)

Although recent advances in deep learning accelerated an improvement in a weakly supervised object localization (WSOL) task, there are still challenges to identify the entire body of an object, rather than only discriminative parts. In this paper, we propose a novel residual fine-grained attention (RFGA) module that autonomously excites the less activated regions of an object by utilizing information distributed over channels and locations within feature maps in combination with a residual operation. To be specific, we devise a series of mechanisms of triple-view attention representation, attention expansion, and feature calibration. Unlike other attention-based WSOL methods that learn a coarse attention map, having the same values across elements in feature maps, our proposed RFGA learns fine-grained values in an attention map by assigning different attention values for each of the elements. We validated the superiority of our proposed RFGA module by comparing it with the recent methods in the literature over three datasets. Further, we analyzed the effect of each mechanism in our RFGA and visualized attention maps to get insights.

翻译：虽然在深层学习方面最近的进展加快了对受微弱监督的物体定位任务(WSOL)的改进,但在查明物体的整个体而不是仅仅区分部分方面仍然存在挑战。在本文件中,我们提出一个新的残余细微关注模块,通过利用通过频道和地点在特征地图内传播的信息,同时结合残余操作,自主地激发物体中较不活跃的区域。具体地说,我们设计了一系列三眼关注代表、注意力扩大和特征校准机制。与其他基于注意的WSOL方法不同,它学习粗糙的注意地图,在特征地图中各有相同的值,我们提议的RFGA在关注地图中学习细微的值,对每个元素给予不同的注意值。我们验证了我们提议的RFGA模块的优越性,将它与文献中三个数据集的最新方法进行了比较。此外,我们分析了我们RFGA中每个机制的效果,以及可视化关注地图以获得洞察力。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【UC伯克利】自监督视觉表示学习，356页ppt，Self-Supervised Visual Learning

专知会员服务

66+阅读 · 2021年1月10日

【NeurIPS2020】针对弱监督目标检测的综合注意力自蒸馏

专知会员服务

32+阅读 · 2020年11月12日

【CVPR2020】实例感知、上下文聚焦和内存有效的弱监督目标检测，Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

专知会员服务

34+阅读 · 2020年4月11日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日