With the growing use of information technology in all life domains, hacking has become more negatively effective than ever before. Also with developing technologies, attacks numbers are growing exponentially every few months and become more sophisticated so that traditional IDS becomes inefficient detecting them. This paper proposes a solution to detect not only new threats with higher detection rate and lower false positive than already used IDS, but also it could detect collective and contextual security attacks. We achieve those results by using Networking Chatbot, a deep recurrent neural network: Long Short Term Memory (LSTM) on top of Apache Spark Framework that has an input of flow traffic and traffic aggregation and the output is a language of two words, normal or abnormal. We propose merging the concepts of language processing, contextual analysis, distributed deep learning, big data, anomaly detection of flow analysis. We propose a model that describes the network abstract normal behavior from a sequence of millions of packets within their context and analyzes them in near real-time to detect point, collective and contextual anomalies. Experiments are done on MAWI dataset, and it shows better detection rate not only than signature IDS, but also better than traditional anomaly IDS. The experiment shows lower false positive, higher detection rate and better point anomalies detection. As for prove of contextual and collective anomalies detection, we discuss our claim and the reason behind our hypothesis. But the experiment is done on random small subsets of the dataset because of hardware limitations, so we share experiment and our future vision thoughts as we wish that full prove will be done in future by other interested researchers who have better hardware infrastructure than ours.
翻译:随着信息技术在所有生命领域的日益应用,黑客比以往任何时候都更具负面效果。此外,随着技术的开发,袭击数量每几个月都成倍增长,并且变得更加复杂,传统IDS的探测效率低下。本文件提出一个解决方案,不仅能够发现探测率较高、反向反应低于已经使用的IDS的新的威胁,而且能够发现集体和背景安全攻击。我们通过使用网络查博特(Chatbot)这个深层次的经常性神经网络来取得这些结果:Apache Spark框架之上的长期短期记忆(LSTM),它含有流量流量和流量汇总的投入,而产出是两种语言的语言,正常或异常的语言。我们提议合并语言处理、背景分析、散发的深度学习、大数据、异常检测流动分析等概念的概念。我们提出一个模型,用以描述网络在其背景下的数百万个包序列中的抽象正常行为,并在近实时时间分析它们以探测点、集体和背景异常。在AWA数据库上进行的实验不仅比签名IDS综合,而且比传统的异常数据系统要好。我们建议合并语言处理语言处理,我们未来的图像分析中要显示我们的未来的更深层次分析比率,因为我们的未来的精确测测测测测算比我们更低点,因为我们的未来测算比我们更低。我们的未来测算比我们更精确测算比我们更精确,因为我们的未来测测算比比我们更精确测测算比我们更精确。我们更。