化妆216:以反对立关注代表形式确认圆形标志 (Makeup216: Logo Recognition with Adversarial Attention Representations)

One of the challenges of logo recognition lies in the diversity of forms, such as symbols, texts or a combination of both; further, logos tend to be extremely concise in design while similar in appearance, suggesting the difficulty of learning discriminative representations. To investigate the variety and representation of logo, we introduced Makeup216, the largest and most complex logo dataset in the field of makeup, captured from the real world. It comprises of 216 logos and 157 brands, including 10,019 images and 37,018 annotated logo objects. In addition, we found that the marginal background around the pure logo can provide a important context information and proposed an adversarial attention representation framework (AAR) to attend on the logo subject and auxiliary marginal background separately, which can be combined for better representation. Our proposed framework achieved competitive results on Makeup216 and another large-scale open logo dataset, which could provide fresh thinking for logo recognition. The dataset of Makeup216 and the code of the proposed framework will be released soon.

翻译：标识识别的挑战之一在于各种形式的多样性,例如符号、文本或两者的结合;此外,标识在设计上往往非常简洁,但外观相似,表明难以学习歧视性表述,为调查标识的种类和代表性,我们引入了化妆216,这是化妆领域最大和最复杂的标识数据集,从真实世界中采集,包括216个标识和157个品牌,包括10 019个图像和37 018个附加说明的标识对象。此外,我们发现,纯标识周围的边缘背景可以提供重要的背景信息,并提议一个可单独参加标识主题和辅助边际背景的对立关注代表框架(AAR),这一框架可以合并,以提高代表性。我们提议的框架在化妆216和另一个大规模开放标识数据集方面取得了竞争性结果,可为标识识别提供新的思维。Makeup216数据集和拟议框架的代码将很快发布。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【上海交大】<操作系统> 2021课程，附课件

专知会员服务

42+阅读 · 2021年4月3日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日