利用基于投标的管道对附加说明的图像和视频数据进行合成,改进许可证板的识别 (Synthesizing Annotated Image and Video Data Using a Rendering-Based Pipeline for Improved License Plate Recognition)

An insufficient number of training samples is a common problem in neural network applications. While data augmentation methods require at least a minimum number of samples, we propose a novel, rendering-based pipeline for synthesizing annotated data sets. Our method does not modify existing samples but synthesizes entirely new samples. The proposed rendering-based pipeline is capable of generating and annotating synthetic and partly-real image and video data in a fully automatic procedure. Moreover, the pipeline can aid the acquisition of real data. The proposed pipeline is based on a rendering process. This process generates synthetic data. Partly-real data bring the synthetic sequences closer to reality by incorporating real cameras during the acquisition process. The benefits of the proposed data generation pipeline, especially for machine learning scenarios with limited available training data, are demonstrated by an extensive experimental validation in the context of automatic license plate recognition. The experiments demonstrate a significant reduction of the character error rate and miss rate from 73.74% and 100% to 14.11% and 41.27% respectively, compared to an OCR algorithm trained on a real data set solely. These improvements are achieved by training the algorithm on synthesized data solely. When additionally incorporating real data, the error rates can be decreased further. Thereby, the character error rate and miss rate can be reduced to 11.90% and 39.88% respectively. All data used during the experiments as well as the proposed rendering-based pipeline for the automated data generation is made publicly available under (URL will be revealed upon publication).

翻译：在神经网络应用中,培训样本数量不足是一个常见的问题。虽然数据增强方法至少需要最低数量的样本,但我们建议采用新的、基于基础的管道,以合成附加说明的数据集。我们的方法并不修改现有的样本,而是合成全新的样本。拟议的基于铺面的管道能够在完全自动的程序中生成和批注合成和部分真实的图像和视频数据。此外,管道可以帮助获取真实数据。拟议管道基于一个传输过程。该流程将生成合成数据。部分真实数据使合成序列更加接近现实,在采购过程中纳入真实的相机。拟议的数据生成管道,特别是用于现有培训数据有限的机器学习情景,其好处通过在自动牌照识别过程中进行广泛的实验验证而得到证明。实验表明,与仅通过在真实数据集上培训的OCR算法相比,性质错误率分别从73.74%和100 %降至14.11%和41.27%。这些改进是通过仅对合成数据进行整合的算法进行进一步的培训而实现的。在获取过程中,仅对合成数据进行第11.88号中,在进一步纳入真实数据时,将使用所有生成率时,将降低数据的比例。