As the central nerve of the intelligent vehicle control system, the in-vehicle network bus is crucial to the security of vehicle driving. One of the best standards for the in-vehicle network is the Controller Area Network (CAN bus) protocol. However, the CAN bus is designed to be vulnerable to various attacks due to its lack of security mechanisms. To enhance the security of in-vehicle networks and promote the research in this area, based upon a large scale of CAN network traffic data with the extracted valuable features, this study comprehensively compared fully-supervised machine learning with semi-supervised machine learning methods for CAN message anomaly detection. Both traditional machine learning models (including single classifier and ensemble models) and neural network based deep learning models are evaluated. Furthermore, this study proposed a deep autoencoder based semi-supervised learning method applied for CAN message anomaly detection and verified its superiority over other semi-supervised methods. Extensive experiments show that the fully-supervised methods generally outperform semi-supervised ones as they are using more information as inputs. Typically the developed XGBoost based model obtained state-of-the-art performance with the best accuracy (98.65%), precision (0.9853), and ROC AUC (0.9585) beating other methods reported in the literature.
翻译:作为智能车辆控制系统的核心神经,车辆内网络公交系统对于车辆驾驶的安全至关重要。车辆内网络公交系统的最佳标准之一是控制区网络(CAN)协议。然而,CAN公交系统的设计因缺乏安全机制而易受各种攻击。为了加强车辆内网络的安全,促进这一领域的研究,根据大规模CAN网络数据及其提取的宝贵特征,本研究报告全面比较了完全监督的机器学习和半监督的机器学习方法,以便探测CAN信息异常现象。对传统机器学习模式(包括单一分类和聚合模型)和基于深层学习模型的神经网络都进行了评估。此外,这项研究还提出了一种基于深度自动校准的半监督学习方法,用于CAN信息异常探测并核实其优于其他半监督方法。广泛的实验表明,完全监督的方法一般比半监督的机器学习方法要强,因为它们正在使用更多的信息。典型的情况是,发达的XGBoost模型(包括单一分类和聚合模型)和基于深层学习模型的神经网络模式。此外,本研究提出了一种基于深层自动编码的半监督学习方法,用以探测和校正(98)报告最佳精确度(O-65)其他精确度。