Recently, logo detection has received more and more attention for its wide applications in the multimedia field, such as intellectual property protection, product brand management, and logo duration monitoring. Unlike general object detection, logo detection is a challenging task, especially for small logo objects and large aspect ratio logo objects in the real-world scenario. In this paper, we propose a novel approach, named Discriminative Semantic Feature Pyramid Network with Guided Anchoring (DSFP-GA), which can address these challenges via aggregating the semantic information and generating different aspect ratio anchor boxes. More specifically, our approach mainly consists of Discriminative Semantic Feature Pyramid (DSFP) and Guided Anchoring (GA). Considering that low-level feature maps that are used to detect small logo objects lack semantic information, we propose the DSFP, which can enrich more discriminative semantic features of low-level feature maps and can achieve better performance on small logo objects. Furthermore, preset anchor boxes are less efficient for detecting large aspect ratio logo objects. We therefore integrate the GA into our method to generate large aspect ratio anchor boxes to mitigate this issue. Extensive experimental results on four benchmarks demonstrate the effectiveness of our proposed DSFP-GA. Moreover, we further conduct visual analysis and ablation studies to illustrate the advantage of our method in detecting small and large aspect logo objects. The code and models can be found at https://github.com/Zhangbaisong/DSFP-GA.
翻译:最近,标识探测在多媒体领域的广泛应用,如知识产权保护、产品品牌管理和标识持续时间监测等,得到了越来越多的关注。与一般目标探测不同,标识探测是一项艰巨的任务,特别是对于在现实世界情景中小型标识物体和大端比标识物体而言。在本文中,我们提出一种新颖的方法,名为 " 带有制导动图解析图的词性图性图像网 " (DSFP-GA),它能够通过汇总语义信息并生成不同方位定位标记框来应对这些挑战。更具体地说,我们的方法主要包括分辨语义特异功能图和导导Anchoring(GA)。考虑到用于探测小标识物体的低端特征图象图缺乏语义信息,我们建议DSFP,它能够丰富低级特征图谱图的更具歧视性的语义特征,并能在小型标识对象上取得更好的性能。此外,预设锚定位框对于探测大方位标值目标的效率较低。我们因此将GA方法纳入我们的大端定位定位定位定位图象定位框和图像定位框的优势。我们为DGA-GA-GA分析的大规模分析。