The rise of the new generation of cyber threats demands more sophisticated and intelligent cyber defense solutions equipped with autonomous agents capable of learning to make decisions without the knowledge of human experts. Several reinforcement learning methods (e.g., Markov) for automated network intrusion tasks have been proposed in recent years. In this paper, we introduce a new generation of network intrusion detection methods that combines a Q-learning-based reinforcement learning with a deep-feed forward neural network method for network intrusion detection. Our proposed Deep Q-Learning (DQL) model provides an ongoing auto-learning capability for a network environment that can detect different types of network intrusions using an automated trial-error approach and continuously enhance its detection capabilities. We provide the details of fine-tuning different hyperparameters involved in the DQL model for more effective self-learning. According to our extensive experimental results based on the NSL-KDD dataset, we confirm that the lower discount factor which is set as 0.001 under 250 episodes of training yields the best performance results. Our experimental results also show that our proposed DQL is highly effective in detecting different intrusion classes and outperforms other similar machine learning approaches.
翻译:新一代网络威胁的兴起要求更精密、更智能的网络防御解决方案,这些解决方案应配备能够自主地在人类专家不知情的情况下学习决策的自主代理器。近年来,为自动网络入侵任务提出了几种强化学习方法(如Markov)。在本文件中,我们引入了新一代网络入侵探测方法,将基于Q的学习强化学习与深网入侵探测的远方神经网络方法相结合。我们提议的深Q学习(DQL)模型为网络环境提供了一个持续的自动学习能力,这种网络环境能够使用自动试镜方法探测不同类型的网络入侵,并不断加强其探测能力。我们提供了DQL模型中涉及不同超参数的微调细节,以便更有效地自我学习。根据我们基于NSL-KDD数据集的广泛实验结果,我们确认,低于250个培训周期的0.001的低折扣系数产生了最佳的绩效结果。我们的实验结果还表明,我们拟议的DQL在探测不同入侵舱和超越其他类似机器学习方法方面非常有效。