为何要为现场文字识别尝试真实数据 (Why You Should Try the Real Data for the Scene Text Recognition)

Recent works in the text recognition area have pushed forward the recognition results to the new horizons. But for a long time a lack of large human-labeled natural text recognition datasets has been forcing researchers to use synthetic data for training text recognition models. Even though synthetic datasets are very large (MJSynth and SynthTest, two most famous synthetic datasets, have several million images each), their diversity could be insufficient, compared to natural datasets like ICDAR and others. Fortunately, the recently released text-recognition annotation for OpenImages V5 dataset has comparable with synthetic dataset number of instances and more diverse examples. We have used this annotation with a Text Recognition head architecture from the Yet Another Mask Text Spotter and got comparable to the SOTA results. On some datasets we have even outperformed previous SOTA models. In this paper we also introduce a text recognition model. The model's code is available.

翻译：文本识别领域的近期工作将识别结果推向了新视野。但长期以来,缺少大量人类标签的自然文本识别数据集迫使研究人员将合成数据用于培训文本识别模型。即使合成数据集非常庞大(MJSynth和SynthTest,两个最著名的合成数据集,每个数据集都有几百万张图像),与ICDAR等自然数据集相比,它们的多样性可能不够。幸运的是,最近发布的OpenImages V5数据集的文本识别注释与合成数据集实例和更多样的例子数量相仿。我们使用该批注与“又一个面具文本显示器”的文本识别头结构相仿,并与SOTA结果相仿。在有些数据集上,我们甚至比以往的SOTA模型还差。在本文中,我们还引入了一个文本识别模型。该模型的代码是可用的。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

基于深度学习的事件因果关系抽取综述

专知会员服务

80+阅读 · 2021年5月27日

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日