In this paper, we propose HyperVision, a realtime unsupervised machine learning (ML) based malicious traffic detection system. Particularly, HyperVision is able to detect unknown patterns of encrypted malicious traffic by utilizing a compact inmemory graph built upon the traffic patterns. The graph captures flow interaction patterns represented by the graph structural features, instead of the features of specific known attacks. We develop an unsupervised graph learning method to detect abnormal interaction patterns by analyzing the connectivity, sparsity, and statistical features of the graph, which allows HyperVision to detect various encrypted attack traffic without requiring any labeled datasets of known attacks. Moreover, we establish an information theory model to demonstrate that the information preserved by the graph approaches the ideal theoretical bound. We show the performance of HyperVision by real-world experiments with 92 datasets including 48 attacks with encrypted malicious traffic. The experimental results illustrate that HyperVision achieves at least 0.92 AUC and 0.86 F1, which significantly outperform the state-of-the-art methods. In particular, more than 50% attacks in our experiments can evade all these methods. Moreover, HyperVision achieves at least 80.6 Gb/s detection throughput with the average detection latency of 0.83s.
翻译:在本文中,我们提出超视像,这是一个不受监管的实时机器学习(ML)的恶意交通探测系统。 特别是,超视能通过使用基于交通模式的紧凑的模拟图,能够探测出未知的加密恶意交通模式。 图形捕捉了以图形结构特征代表的流程互动模式,而不是特定已知袭击的特征。 我们开发了一个不受监管的图形学习方法,通过分析图的连接性、偏移性和统计特征,检测异常互动模式,让超视能在不要求任何已知袭击的标签数据集的情况下检测各种加密袭击流量。 此外,我们建立了一个信息理论模型,以证明图形保存的信息接近理想的理论约束。我们用92个数据集展示了超视像的性能,包括48个加密恶意交通袭击。实验结果表明超视像至少达到0.92奥地利克和0.86F1,这大大超越了最新技术方法。 特别是,我们实验中超过50%的攻击可以回避所有这些方法。 此外, 超视像3在80.6中至少能测到0.8G。