Intrusion Detection Systems (IDS) are critical security mechanisms that protect against a wide variety of network threats and malicious behaviors on networks or hosts. As both Network-based IDS (NIDS) or Host-based IDS (HIDS) have been widely investigated, this paper aims to present a Combined Intrusion Detection System (CIDS) that integrates network and host data in order to improve IDS performance. Due to the scarcity of datasets that include both network packet and host data, we present a novel CIDS dataset formation framework that can handle log files from a variety of operating systems and align log entities with network flows. A new CIDS dataset named SCVIC-CIDS-2021 is derived from the meta-data from the well-known benchmark dataset, CIC-IDS-2018 by utilizing the proposed framework. Furthermore, a transformer-based deep learning model named CIDS-Net is proposed that can take network flow and host features as inputs and outperform baseline models that rely on network flow features only. Experimental results to evaluate the proposed CIDS-Net under the SCVIC-CIDS-2021 dataset support the hypothesis for the benefits of combining host and flow features as the proposed CIDS-Net can improve the macro F1 score of baseline solutions by 6.36% (up to 99.89%).
翻译:入侵探测系统(入侵探测系统)是防范网络或主机上各种网络威胁和恶意行为的关键安全机制。由于对基于网络的ISDS(NIDS)或主机的ISDS(HIDS)进行了广泛调查,本文件的目的是提出一个综合网络和主机数据的入侵探测综合系统(CIDS),以改善ISDS的性能。由于缺少包括网络包和主机数据的数据集,我们提出了一个新的CIDS数据集形成框架,它能够处理各种操作系统的日志文件,并使日志实体与网络流接轨。一个新的CIDS数据集名为SCVIC-CIDES-2021,它是从众所周知的基准数据集CIC-IDS-2018的元数据中衍生出来的。此外,还提议采用一个基于变压器的深度学习模型(CIDS-Net),它可以将网络流和主机特征作为投入和仅依赖网络流特征的超出常规基线模型。实验结果,用以评价在SCVIC-CIS-2021中拟议的CIDS-CS-2021数据库网络网络的日志。一个新的数据集集数据集来自著名的基准数据集集,通过提议的6.89MISDS-cregresmalet 的模型,可以改进提议的流的模型的模型的收益。