In this paper, we introduce a new dataset of room interior pictures with overlaying and scene text, totalling to 4836 annotated images in 25 product categories. We provide details on the collection and annotation process of our dataset, and analyze its statistics. Furthermore, we propose a baseline method for overlaying text detection, that leverages the character region-aware text detection framework to guide the classification model. We validate our approach and show its efficiency in terms of binary classification metrics, reaching the final performance of 0.95 F1 score, with false positive and false negative rates of 0.02 and 0.06 correspondingly.
翻译:在本文中,我们引入了一套新的室内室内照片数据集,内含重叠和场景文本,总共在25个产品类别中提供了4836张附加说明的图像,我们详细介绍了我们数据集的收集和注释过程,并分析了其统计数据。此外,我们提出了一种覆盖文本检测基线方法,利用字符区域识别文本检测框架来指导分类模式。我们验证了我们的方法,并显示了其二元分类指标的效率,达到了0.95 F1分的最后性能,正值和负值分别为0.02和0.06,相应的负值为0.02和0.06。