We propose an automatic framework for toll collection, consisting of three steps: vehicle type recognition, license plate localization, and reading. However, each of the three steps becomes non-trivial due to image variations caused by several factors. The traditional vehicle decorations on the front cause variations among vehicles of the same type. These decorations make license plate localization and recognition difficult due to severe background clutter and partial occlusions. Likewise, on most vehicles, specifically trucks, the position of the license plate is not consistent. Lastly, for license plate reading, the variations are induced by non-uniform font styles, sizes, and partially occluded letters and numbers. Our proposed framework takes advantage of both data availability and performance evaluation of the backbone deep learning architectures. We gather a novel dataset, \emph{Diverse Vehicle and License Plates Dataset (DVLPD)}, consisting of 10k images belonging to six vehicle types. Each image is then manually annotated for vehicle type, license plate, and its characters and digits. For each of the three tasks, we evaluate You Only Look Once (YOLO)v2, YOLOv3, YOLOv4, and FasterRCNN. For real-time implementation on a Raspberry Pi, we evaluate the lighter versions of YOLO named Tiny YOLOv3 and Tiny YOLOv4. The best Mean Average Precision (mAP@0.5) of 98.8% for vehicle type recognition, 98.5% for license plate detection, and 98.3% for license plate reading is achieved by YOLOv4, while its lighter version, i.e., Tiny YOLOv4 obtained a mAP of 97.1%, 97.4%, and 93.7% on vehicle type recognition, license plate detection, and license plate reading, respectively. The dataset and the training codes are available at https://github.com/usama-x930/VT-LPR
翻译:我们建议一个自动的收费收集框架,由三步组成:车辆类型识别、车牌本地化和阅读。然而,由于若干因素造成的图像变异,这三步中的每一个都变得非三角性。前方的传统车辆装饰导致同一类型车辆的变异。这些装饰使得牌照本地化和识别困难,因为背景严重模糊和部分隔绝。同样,在大多数车辆,特别是卡车上,牌照的位置不一致。最后,车牌阅读,这些变异是由非统一字体样式、尺寸和部分隐蔽字母和数字引起的。我们提议的框架利用了骨干深层学习结构的数据提供和性能评价。我们收集了一个新的数据集,\emph{Dirversive 车辆和牌照数据集(DVLLD)由属于六种车型的10k图像组成。每张图随后手动了车辆类型、牌照、牌照、品型号和数字。对于这三项任务,我们只看一次(YOL OrightO) Rir-O 版本的98O;YvOL 版本的 Real-O;YvOL 版本的 Real-O;YvOL 版本的 Real-O;YvO) 版本的 Real-O 版本的 Rev-O;Yv-O 版本的 Rev-deal-deal-O 版本, 版本的 Rev-dent;Yv-O;