Internet traffic classification plays a key role in network visibility, Quality of Services (QoS), intrusion detection, Quality of Experience (QoE) and traffic-trend analyses. In order to improve privacy, integrity, confidentiality, and protocol obfuscation, the current traffic is based on encryption protocols, e.g., SSL/TLS. With the increased use of Machine-Learning (ML) and Deep-Learning (DL) models in the literature, comparison between different models and methods has become cumbersome and difficult due to a lack of a standardized framework. In this paper, we propose an open-source framework, named OSF-EIMTC, which can provide the full pipeline of the learning process. From the well-known datasets to extracting new and well-known features, it provides implementations of well-known ML and DL models (from the traffic classification literature) as well as evaluations. Such a framework can facilitate research in traffic classification domains, so that it will be more repeatable, reproducible, easier to execute, and will allow a more accurate comparison of well-known and novel features and models. As part of our framework evaluation, we demonstrate a variety of cases where the framework can be of use, utilizing multiple datasets, models, and feature sets. We show analyses of publicly available datasets and invite the community to participate in our open challenges using the OSF-EIMTC.
翻译:互联网交通分类在网络可见度、服务质量(QOS)、入侵检测、经验质量(QoE)和交通趋势分析方面发挥着关键作用。为了提高隐私、完整性、保密性和协议模糊性,目前的交通量以加密协议为基础,例如SSL/TLS。随着文献中日益使用机器学习和深学习模式,不同模式和方法之间的比较变得繁琐和困难,因为缺乏标准化框架。在本文件中,我们提议了一个名为OSF-EIMTC的开放源码框架,可以提供学习过程的完整管道。从众所周知的数据集到提取新的和众所周知的特征,目前的通信量以加密协议为基础。随着机械学习和深学习模式(DL)在文献中日益使用,这种框架可以促进交通分类领域的研究,从而更加可重复、可复制、更容易执行和便于更准确地比较OSF-ES-EIMTC的公开源码框架。从众所周知的数据集数据集集到公开数据模型的公开分析,我们使用的多种模式和新模式的演示。