Coronavirus 2019 has brought severe challenges to social stability and public health worldwide. One effective way of curbing the epidemic is to require people to wear masks in public places and monitor mask-wearing states by utilizing suitable automatic detectors. However, existing deep learning based models struggle to simultaneously achieve the requirements of both high precision and real-time performance. To solve this problem, we propose an improved lightweight face mask detector based on YOLOv5, which can achieve an excellent balance of precision and speed. Firstly, a novel backbone ShuffleCANet that combines ShuffleNetV2 network with Coordinate Attention mechanism is proposed as the backbone. Afterwards, an efficient path aggression network BiFPN is applied as the feature fusion neck. Furthermore, the localization loss is replaced with alpha-CIoU in model training phase to obtain higher-quality anchors. Some valuable strategies such as data augmentation, adaptive image scaling, and anchor cluster operation are also utilized. Experimental results on AIZOO face mask dataset show the superiority of the proposed model. Compared with the original YOLOv5, the proposed model increases the inference speed by 28.3% while still improving the precision by 0.58%. It achieves the best mean average precision of 95.2% compared with other seven existing models, which is 4.4% higher than the baseline.
翻译:2019年科罗纳病毒给世界社会稳定和公共卫生带来了严峻的挑战。遏制这一流行病的有效方法之一是要求人们在公共场所戴面罩,并通过使用合适的自动探测器监测戴面罩的国家。然而,现有的深层次基于学习的模型努力同时达到高精度和实时性能的要求。为了解决这个问题,我们提议以YOLOv5为基础改进轻量面罩探测器,这可以实现极佳的精确和速度平衡。首先,将ShuffleNetV2网络与协调关注机制相结合的新型骨干ShuffleCanet作为主干线。随后,将高效路径入侵网络BiFPN用作特征聚变颈部。此外,在模型培训阶段,当地化损失被用阿尔法-CIoU取代,以获得更高质量的锚。还利用了一些有价值的战略,如数据增强、适应性图像缩放和锚定集操作等。AIZO 面罩数据集的实验结果显示拟议模型的优越性。与原始YOLOv5号相比,拟议模型的推导速度提高了28.3%,而目前的平均精确度则提高了95.2%,而现在的精确度则提高了0.4.4%。