Traffic classification associates packet streams with known application labels, which is vital for network security and network management. With the rise of NAT, port dynamics, and encrypted traffic, it is increasingly challenging to obtain unified traffic features for accurate classification. Many state-of-the-art traffic classifiers automatically extract features from the packet stream based on deep learning models such as convolution networks. Unfortunately, the compositional and causal relationships between packets are not well extracted in these deep learning models, which affects both prediction accuracy and generalization on different traffic types. In this paper, we present a chained graph model on the packet stream to keep the chained compositional sequence. Next, we propose CGNN, a graph neural network based traffic classification method, which builds a graph classifier over automatically extracted features over the chained graph. Extensive evaluation over real-world traffic data sets, including normal, encrypted and malicious labels, show that, CGNN improves the prediction accuracy by 23\% to 29\% for application classification, by 2\% to 37\% for malicious traffic classification, and reaches the same accuracy level for encrypted traffic classification. CGNN is quite robust in terms of the recall and precision metrics. We have extensively evaluated the parameter sensitivity of CGNN, which yields optimized parameters that are quite effective for traffic classification.
翻译:在网络安全和网络管理方面,已知应用标签的交通分类关联包流对网络安全和网络管理至关重要。随着NAT、港口动态和加密交通的上升,获取统一交通特征以进行准确分类越来越具有挑战性。许多最先进的交通分类人员根据深层学习模型(如连动网络)自动从数据流中提取特征。不幸的是,在这些深层学习模型中,包的构成和因果关系没有很好地提取,这影响到不同交通种类的预测准确性和一般化。在本文中,我们在数据流上展示了一个链式图表模型,以保持链条的构成序列。接下来,我们提议以图形神经网络为基础的交通分类方法CGNN,在链式图上自动提取的特征上建立一个图形分类器。对真实世界交通数据集,包括正常、加密和恶意标签进行广泛的评价,表明CGNNN的预测准确度从23 ⁇ 提高到29 ⁇,用于应用分类,恶意交通分类的精确度为2 ⁇ 至37 ⁇,并达到加密交通分类的精确度水平。CGNNNNN非常可靠,对运输量的精确度进行了广泛的精确度评估。