Indian vehicle number plates have wide variety in terms of size, font, script and shape. Development of Automatic Number Plate Recognition (ANPR) solutions is therefore challenging, necessitating a diverse dataset to serve as a collection of examples. However, a comprehensive dataset of Indian scenario is missing, thereby, hampering the progress towards publicly available and reproducible ANPR solutions. Many countries have invested efforts to develop comprehensive ANPR datasets like Chinese City Parking Dataset (CCPD) for China and Application-oriented License Plate (AOLP) dataset for US. In this work, we release an expanding dataset presently consisting of 1.5k images and a scalable and reproducible procedure of enhancing this dataset towards development of ANPR solution for Indian conditions. We have leveraged this dataset to explore an End-to-End (E2E) ANPR architecture for Indian scenario which was originally proposed for Chinese Vehicle number-plate recognition based on the CCPD dataset. As we customized the architecture for our dataset, we came across insights, which we have discussed in this paper. We report the hindrances in direct reusability of the model provided by the authors of CCPD because of the extreme diversity in Indian number plates and differences in distribution with respect to the CCPD dataset. An improvement of 42.86% was observed in LP detection after aligning the characteristics of Indian dataset with Chinese dataset. In this work, we have also compared the performance of the E2E number-plate detection model with YOLOv5 model, pre-trained on COCO dataset and fine-tuned on Indian vehicle images. Given that the number Indian vehicle images used for fine-tuning the detection module and yolov5 were same, we concluded that it is more sample efficient to develop an ANPR solution for Indian conditions based on COCO dataset rather than CCPD dataset.
翻译:印度车辆牌照在大小、字体、脚本和形状方面差异很大。 因此,自动数字板识别(ANPR)解决方案的开发具有挑战性,因此,我们发布一个扩大的数据集,目前由1.5k图像组成,并需要一套可缩放和可复制的数据集作为实例收集。 然而,印度情况的综合数据集缺失,从而阻碍了在公开提供和可复制的ANPR解决方案方面取得进展。许多国家已经投入努力开发全面的ANPR数据集,如中国的中国城市泊车数据集(CCPD)和面向应用程序的美国许可证板数据集。在这项工作中,我们发布了一个由1.5k图像组成的扩大数据集,以及一个可缩放和可复制的程序,加强这一数据集,以用于开发印度条件的ANPR解决方案解决方案解决方案解决方案。我们利用这一数据集来探索“End-End”(E2E)和可复制的印度情况。