The number of traffic accidents has been continuously increasing in recent years worldwide. Many accidents are caused by distracted drivers, who take their attention away from driving. Motivated by the success of Convolutional Neural Networks (CNNs) in computer vision, many researchers developed CNN-based algorithms to recognize distracted driving from a dashcam and warn the driver against unsafe behaviors. However, current models have too many parameters, which is unfeasible for vehicle-mounted computing. This work proposes a novel knowledge-distillation-based framework to solve this problem. The proposed framework first constructs a high-performance teacher network by progressively strengthening the robustness to illumination changes from shallow to deep layers of a CNN. Then, the teacher network is used to guide the architecture searching process of a student network through knowledge distillation. After that, we use the teacher network again to transfer knowledge to the student network by knowledge distillation. Experimental results on the Statefarm Distracted Driver Detection Dataset and AUC Distracted Driver Dataset show that the proposed approach is highly effective for recognizing distracted driving behaviors from photos: (1) the teacher network's accuracy surpasses the previous best accuracy; (2) the student network achieves very high accuracy with only 0.42M parameters (around 55% of the previous most lightweight model). Furthermore, the student network architecture can be extended to a spatial-temporal 3D CNN for recognizing distracted driving from video clips. The 3D student network largely surpasses the previous best accuracy with only 2.03M parameters on the Drive&Act Dataset. The source code is available at https://github.com/Dichao-Liu/Lightweight_Distracted_Driver_Recognition_with_Distillation-Based_NAS_and_Knowledge_Transfer.
翻译:近些年来,世界各地交通事故的数量持续增加。许多事故都是由分散注意力的驱动者引发的,他们转移了对驾驶的注意力。在计算机视觉中,受Culual Neal网络(CNN)的成功激励,许多研究人员开发了CNN的算法,以识别从破碎摄像头驱动的分心,并警告驱动者避免不安全的行为。然而,目前的模型有太多参数,对于车辆上载计算来说是行不通的。这项工作提出了一个新的基于知识蒸馏的框架,以在很大程度上解决该问题。拟议框架首先通过逐步加强神经神经神经网络(CNN)从浅层到深层对光亮度变化的精度,构建了一个高性教师网络。之后,我们再次使用教师网络,通过知识蒸馏将知识传输到学生网络。 国家农场中吸引的驱动力驱动器检测数据元和AUCUC 蒸汽驱动力驱动力数据集显示,拟议的方法对于从照片中识别分心驱动力驱动力驱动力动作的精度的精度测试网络,Sentreal_rental salal_D