Network Intrusion Detection Systems (NIDSs) are important tools for the protection of computer networks against increasingly frequent and sophisticated cyber attacks. Recently, a lot of research effort has been dedicated to the development of Machine Learning (ML) based NIDSs. As in any ML-based application, the availability of high-quality datasets is critical for the training and evaluation of ML-based NIDS. One of the key problems with the currently available datasets is the lack of a standard feature set. The use of a unique and proprietary set of features for each of the publicly available datasets makes it virtually impossible to compare the performance of ML-based traffic classifiers on different datasets, and hence to evaluate the ability of these systems to generalise across different network scenarios. To address that limitation, this paper proposes and evaluates standard NIDS feature sets based on the NetFlow network meta-data collection protocol and system. We evaluate and compare two NetFlow-based feature set variants, a version with 12 features, and another one with 43 features.
翻译:网络入侵探测系统(NIDS)是保护计算机网络免遭日益频繁和复杂的网络攻击的重要工具。最近,许多研究工作都致力于开发基于机器学习(ML)的NIDS。与任何基于ML的应用一样,高质量数据集的提供对于培训和评价基于ML的NIDS至关重要。现有数据集的主要问题之一是缺乏一套标准特征。对每套公开数据集使用一套独特和专有的功能,使得几乎无法比较基于ML的交通分类器在不同数据集上的性能,从而无法评价这些系统在跨不同网络情景上的一般化能力。为解决这一问题,本文件提议和评价基于NetFlow网络元数据收集协议和系统的NIDS标准特征数据集。我们评价和比较了两个基于NetFlow的功能集变异,一个版本有12个特征,另一个版本有43个特征。