The performance of machine learning based network intrusion detection systems (NIDSs) severely degrades when deployed on a network with significantly different feature distributions from the ones of the training dataset. In various applications, such as computer vision, domain adaptation techniques have been successful in mitigating the gap between the distributions of the training and test data. In the case of network intrusion detection however, the state-of-the-art domain adaptation approaches have had limited success. According to recent studies, as well as our own results, the performance of an NIDS considerably deteriorates when the `unseen' test dataset does not follow the training dataset distribution. In some cases, swapping the train and test datasets makes this even more severe. In order to enhance the generalisibility of machine learning based network intrusion detection systems, we propose to extract domain invariant features using adversarial domain adaptation from multiple network domains, and then apply an unsupervised technique for recognising abnormalities, i.e., intrusions. More specifically, we train a domain adversarial neural network on labelled source domains, extract the domain invariant features, and train a One-Class SVM (OSVM) model to detect anomalies. At test time, we feedforward the unlabeled test data to the feature extractor network to project it into a domain invariant space, and then apply OSVM on the extracted features to achieve our final goal of detecting intrusions. Our extensive experiments on the NIDS benchmark datasets of NFv2-CIC-2018 and NFv2-UNSW-NB15 show that our proposed setup demonstrates superior cross-domain performance in comparison to the previous approaches.
翻译:机器学习基于网络入侵探测系统(NIDS)的性能如果在一个与培训数据集的分布功能大相径庭的网络上部署“看不见”测试数据集,其性能就会严重下降。在计算机视觉等各种应用中,域适应技术成功地缩小了培训和测试数据分布之间的差距。然而,在网络入侵探测中,最先进的域适应方法取得了有限的成功。根据最近的研究以及我们自己的结果,如果“不见”测试数据集不跟随培训数据集的分布,则该数据库的性能会大大下降。在某些情况下,对火车和测试数据集的交换使这一变化更加严重。为了提高机器学习基于网络入侵探测系统的通用性能,我们提议利用对抗域适应多网络域域的先进性能,然后采用一种不严密的识别异常技术,即,拟议的入侵。更具体地说,我们在一个域域域域域网内,将我们S-C的内基域域域域域域域域域域域域域域域域网的变换成一个S-CLVM 测试模型,然后将我们S-LIS的内域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域图测试测试显示S-S-SVS-ServeM的测试测试项目显示S-S-S-SV的S-SlBSVS-S-S-S-Serveg-SVS-S-S-Serg-S-S-Serg-Serg-Stor 测试模型,然后在SVSDSDSDM 测试项目,在Smodrog-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-SV-SDV-SV-SV-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S