With the fast growing demand on new services and applications as well as the increasing awareness of data protection, traditional centralized traffic classification approaches are facing unprecedented challenges. This paper introduces a novel framework, Federated Generative Adversarial Networks and Automatic Classification (FGAN-AC), which integrates decentralized data synthesizing with traffic classification. FGAN-AC is able to synthesize and classify multiple types of service data traffic from decentralized local datasets without requiring a large volume of manually labeled dataset or causing any data leakage. Two types of data synthesizing approaches have been proposed and compared: computation-efficient FGAN (FGAN-\uppercase\expandafter{\romannumeral1}) and communication-efficient FGAN (FGAN-\uppercase\expandafter{\romannumeral2}). The former only implements a single CNN model for processing each local dataset and the later only requires coordination of intermediate model training parameters. An automatic data classification and model updating framework has been proposed to automatically identify unknown traffic from the synthesized data samples and create new pseudo-labels for model training. Numerical results show that our proposed framework has the ability to synthesize highly mixed service data traffic and can significantly improve the traffic classification performance compared to existing solutions.
翻译:由于对新服务和应用的需求迅速增长,而且对数据保护的认识不断提高,传统的中央交通分类方法正面临前所未有的挑战。本文件介绍了一个新的框架,即Freeder Generation Aversarial Networks和自动分类(FGAN-AC),将分散的数据与交通分类相结合。FGAN-AC能够对分散的地方数据集的多种类型的服务数据流量进行综合和分类,而不需要大量人工标签数据集或造成任何数据泄漏。提出了两类数据综合化方法并进行了比较:计算效率高的FGAN(FGAN-uppercase\explaceandafter_romannumberal1})和通信效率高的FGAN(FAN-uppercase\exadandafter_romannuall2})。前者只使用单一的CNNET处理每个本地数据集的模型,后来只需要对中间示范培训参数进行协调。一个自动的数据分类和模型更新框架已经提出,以自动识别综合数据样本中的未知流量,并为模式培训创建新的伪标签。Numeralalalalalalalalmaisal 将大大改进了我们现有的服务分类。