Network traffic classification using machine learning techniques has been widely studied. Most existing schemes classify entire traffic flows, but there are major limitations to their practicality. At a network router, the packets need to be processed with minimum delay, so the classifier cannot wait until the end of the flow to make a decision. Furthermore, a complicated machine learning algorithm can be too computationally expensive to implement inside the router. In this paper, we introduce flow-packet hybrid traffic classification (FPHTC), where the router makes a decision per packet based on a routing policy that is designed through transferring the learned knowledge from a flow-based classifier residing outside the router. We analyze the generalization bound of FPHTC and show its advantage over regular packet-based traffic classification. We present experimental results using a real-world traffic dataset to illustrate the classification performance of FPHTC. We show that it is robust toward traffic pattern changes and can be deployed with limited computational resource.
翻译:已经广泛研究了使用机器学习技术进行网络交通分类的多数现有计划对交通流量进行分类,但实际操作性有重大限制。 在网络路由器上,需要尽可能少地拖延处理数据包,这样分类器就不能等到流程结束再作出决定。此外,复杂的机器学习算法在计算上可能太昂贵,无法在路由器内实施。在本文中,我们引入了流动-包装混合交通分类(FPHTC),路由器根据路线政策对每包进行决策,而路线政策的设计是通过向位于路由器外的流基分类器传授知识。我们分析了FPHTC的通用约束,并展示了它相对于常规基于包的交通分类的优势。我们用真实世界交通数据集来展示FPHTC的分类性表现,我们表明它对于交通模式的变化非常活跃,并且可以在有限的计算资源范围内部署。