One of the challenges of logo recognition lies in the diversity of forms, such as symbols, texts or a combination of both; further, logos tend to be extremely concise in design while similar in appearance, suggesting the difficulty of learning discriminative representations. To investigate the variety and representation of logo, we introduced Makeup216, the largest and most complex logo dataset in the field of makeup, captured from the real world. It comprises of 216 logos and 157 brands, including 10,019 images and 37,018 annotated logo objects. In addition, we found that the marginal background around the pure logo can provide a important context information and proposed an adversarial attention representation framework (AAR) to attend on the logo subject and auxiliary marginal background separately, which can be combined for better representation. Our proposed framework achieved competitive results on Makeup216 and another large-scale open logo dataset, which could provide fresh thinking for logo recognition. The dataset of Makeup216 and the code of the proposed framework will be released soon.
翻译:标识识别的挑战之一在于各种形式的多样性,例如符号、文本或两者的结合;此外,标识在设计上往往非常简洁,但外观相似,表明难以学习歧视性表述,为调查标识的种类和代表性,我们引入了化妆216,这是化妆领域最大和最复杂的标识数据集,从真实世界中采集,包括216个标识和157个品牌,包括10 019个图像和37 018个附加说明的标识对象。此外,我们发现,纯标识周围的边缘背景可以提供重要的背景信息,并提议一个可单独参加标识主题和辅助边际背景的对立关注代表框架(AAR),这一框架可以合并,以提高代表性。我们提议的框架在化妆216和另一个大规模开放标识数据集方面取得了竞争性结果,可为标识识别提供新的思维。Makeup216数据集和拟议框架的代码将很快发布。