指导引人注意推论网络 (Tell Me Where to Look: Guided Attention Inference Network)

Weakly supervised learning with only coarse labels can obtain visual explanations of deep neural network such as attention maps by back-propagating gradients. These attention maps are then available as priors for tasks such as object localization and semantic segmentation. In one common framework we address three shortcomings of previous approaches in modeling such attention maps: We (1) first time make attention maps an explicit and natural component of the end-to-end training, (2) provide self-guidance directly on these maps by exploring supervision form the network itself to improve them, and (3) seamlessly bridge the gap between using weak and extra supervision if available. Despite its simplicity, experiments on the semantic segmentation task demonstrate the effectiveness of our methods. We clearly surpass the state-of-the-art on Pascal VOC 2012 val. and test set. Besides, the proposed framework provides a way not only explaining the focus of the learner but also feeding back with direct guidance towards specific tasks. Under mild assumptions our method can also be understood as a plug-in to existing weakly supervised learners to improve their generalization performance.

翻译：微弱监督的学习,只有粗糙的标签,才能获得深神经网络的直观解释,例如以反射梯度绘制的注意图。这些注意图随后作为物体定位和语义分割等任务的前题提供。在一个共同框架内,我们解决了先前在制作注意图方面方法的三个缺点:我们(1) 首次将注意图作为端对端培训的明确和自然组成部分,(2) 通过探索监督网络本身来改进这些地图,直接提供自我指导,(3) 填补网络本身在使用薄弱和额外监督(如果有的话)之间的差距。尽管它很简单,但是关于语义分割的实验显示了我们的方法的有效性。我们显然超越了Pascal VOC 2012val. 和测试集的状态。此外,拟议框架不仅解释了学习者的重点,而且还为具体任务提供了直接指导。根据温和的假设,我们的方法也可以被理解为对现有的低监管学习者的一种插座,以提高其一般性能。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【AAAI 2020】双曲图注意力网络，Hyperbolic Graph Attention Network

专知会员服务

94+阅读 · 2020年6月15日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

专知会员服务

44+阅读 · 2020年3月4日