License plate recognition plays a critical role in many practical applications, but license plates of large vehicles are difficult to be recognized due to the factors of low resolution, contamination, low illumination, and occlusion, to name a few. To overcome the above factors, the transportation management department generally introduces the enlarged license plate behind the rear of a vehicle. However, enlarged license plates have high diversity as they are non-standard in position, size, and style. Furthermore, the background regions contain a variety of noisy information which greatly disturbs the recognition of license plate characters. Existing works have not studied this challenging problem. In this work, we first address the enlarged license plate recognition problem and contribute a dataset containing 9342 images, which cover most of the challenges of real scenes. However, the created data are still insufficient to train deep methods of enlarged license plate recognition, and building large-scale training data is very time-consuming and high labor cost. To handle this problem, we propose a novel task-level disentanglement generation framework based on the Disentangled Generation Network (DGNet), which disentangles the generation into the text generation and background generation in an end-to-end manner to effectively ensure diversity and integrity, for robust enlarged license plate recognition. Extensive experiments on the created dataset are conducted, and we demonstrate the effectiveness of the proposed approach in three representative text recognition frameworks.
翻译:在许多实际应用中,牌照识别在许多实际应用中发挥着关键作用,但大型车辆的牌照牌照由于分辨率低、污染、照明度低和封闭度低等因素而难以被识别。为了克服上述因素,运输管理部通常在车辆后面采用扩大的牌照。不过,扩大的牌照具有高度多样性,因为其位置、大小和风格不标准。此外,背景区域包含各种吵闹的信息,大大扰乱了对牌照字符的识别。现有工作尚未研究这一具有挑战性的问题。在这项工作中,我们首先处理扩大的牌照识别问题,并提供包含9342图象的数据集,其中涵盖了真实场景的大多数挑战。然而,所创造的数据仍不足以在车辆后方培养出深度的牌照识别方法,而建立大型培训数据则非常耗时费和高。为了处理这一问题,我们提议以分解的新一代网络(DGNet)为基础,建立一个新的任务级混乱生成框架,将产品生成与文本生成和背景生成相混淆起来,从而覆盖了9342图象的数据集,其中涵盖了真实场景的多数挑战。但是,所创造的数据仍能有效地展示了我们所创建的多样化和广泛认识。