In this paper, an LSTM autoencoder-based architecture is utilized for drowsiness detection with ResNet-34 as feature extractor. The problem is considered as anomaly detection for a single subject; therefore, only the normal driving representations are learned and it is expected that drowsiness representations, yielding higher reconstruction losses, are to be distinguished according to the knowledge of the network. In our study, the confidence levels of normal and anomaly clips are investigated through the methodology of label assignment such that training performance of LSTM autoencoder and interpretation of anomalies encountered during testing are analyzed under varying confidence rates. Our method is experimented on NTHU-DDD and benchmarked with a state-of-the-art anomaly detection method for driver drowsiness. Results show that the proposed model achieves detection rate of 0.8740 area under curve (AUC) and is able to provide significant improvements on certain scenarios.
翻译:本文使用基于LSTM自动编码器的建筑进行沉睡检测,以ResNet-34作为特征提取器,问题被视为单个对象的异常现象检测;因此,只学习正常的驾驶说明,预计根据网络知识对沉睡表现进行区分,从而造成较高的重建损失;在我们的研究中,通过标签分配方法调查正常和异常片段的可信度水平,以便用不同的信任率分析LSTM自动编码器的培训性能和对测试期间遇到的异常现象的解释;我们的方法在NTHU-DDD上进行试验,并以最先进的异常探测方法为基准来测定驱动器的漂浮情况;结果显示,拟议的模型在曲线下达到0.8740的探测率,并能在某些情景上提供显著改进。