对零热学习的隐含和明确关注 (Implicit and Explicit Attention for Zero-Shot Learning)

Most of the existing Zero-Shot Learning (ZSL) methods focus on learning a compatibility function between the image representation and class attributes. Few others concentrate on learning image representation combining local and global features. However, the existing approaches still fail to address the bias issue towards the seen classes. In this paper, we propose implicit and explicit attention mechanisms to address the existing bias problem in ZSL models. We formulate the implicit attention mechanism with a self-supervised image angle rotation task, which focuses on specific image features aiding to solve the task. The explicit attention mechanism is composed with the consideration of a multi-headed self-attention mechanism via Vision Transformer model, which learns to map image features to semantic space during the training stage. We conduct comprehensive experiments on three popular benchmarks: AWA2, CUB and SUN. The performance of our proposed attention mechanisms has proved its effectiveness, and has achieved the state-of-the-art harmonic mean on all the three datasets.

翻译：现有的零热学习方法大多侧重于学习图像代表与阶级属性之间的兼容功能; 很少有其他方法侧重于学习将地方和全球特征结合起来的图像代表; 然而,现有方法仍然未能解决对所看到类别存在的偏向问题; 在本文件中,我们提出了解决零热学习模式中现有偏向问题的隐含和明确关注机制; 我们以自我监督的图像角度旋转任务来制定隐含的关注机制,侧重于帮助解决任务的具体图像特征; 明确的关注机制包括考虑通过视野变异模型来建立多头型的自留机制,该模型在培训阶段将图像特征映射到语义空间; 我们在三种受欢迎的基准:AWAC2、CUB和SUN上进行全面实验。我们拟议的关注机制的绩效证明了其有效性,并实现了所有三个数据集的最先进的调和度值。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【斯坦福大学课程】2021年深度多任务学习与元学习，CS 330: Deep Multi-Task and Meta Learning

专知会员服务

110+阅读 · 2022年3月2日