有条件学习注意序列视觉任务 (Conditionally Learn to Pay Attention for Sequential Visual Task)

Sequential visual task usually requires to pay attention to its current interested object conditional on its previous observations. Different from popular soft attention mechanism, we propose a new attention framework by introducing a novel conditional global feature which represents the weak feature descriptor of the current focused object. Specifically, for a standard CNN (Convolutional Neural Network) pipeline, the convolutional layers with different receptive fields are used to produce the attention maps by measuring how the convolutional features align to the conditional global feature. The conditional global feature can be generated by different recurrent structure according to different visual tasks, such as a simple recurrent neural network for multiple objects recognition, or a moderate complex language model for image caption. Experiments show that our proposed conditional attention model achieves the best performance on the SVHN (Street View House Numbers) dataset with / without extra bounding box; and for image caption, our attention model generates better scores than the popular soft attention model.

翻译：连续视觉任务通常需要关注其以先前的观测为条件的当前相关对象。不同于大众软关注机制, 我们提出一个新的关注框架, 引入一个新的有条件的全球特征, 代表当前焦点对象的薄弱特征描述符。具体来说, 对于标准的CNN( 革命神经网络) 管道, 使用具有不同接收域的革命层来制作关注图, 测量共进特征如何与有条件的全球特征相匹配。有条件的全球特征可以由不同的常规结构根据不同的视觉任务生成, 如用于多对象识别的简单经常性神经网络, 或用于图像说明的中度复杂语言模型。实验显示, 我们提议的有条件关注模型在 SVHN( 街道浏览房屋数字) 数据集上取得了最佳的性能, 并且没有附加额外的框条框; 对于图像说明, 我们的注意模型比流行的软关注模型产生更好的分数。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【ICLR 2019】双曲注意力网络，Hyperbolic Attention Network

专知会员服务

84+阅读 · 2020年6月21日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【快讯】CVPR2020结果出炉，1470篇上榜，你的paper中了吗？

专知会员服务

51+阅读 · 2020年2月24日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日