Federated Learning (FL) has been recently receiving increasing consideration from the cybersecurity community as a way to collaboratively train deep learning models with distributed profiles of cyberthreats, with no disclosure of training data. Nevertheless, the adoption of FL in cybersecurity is still in its infancy, and a range of practical aspects have not been properly addressed yet. Indeed, the Federated Averaging algorithm at the core of the FL concept requires the availability of test data to control the FL process. Although this might be feasible in some domains, test network traffic of newly discovered attacks cannot be always shared without disclosing sensitive information. In this paper, we address the convergence of the FL process in dynamic cybersecurity scenarios, where the trained model must be frequently updated with new recent attack profiles to empower all members of the federation with latest detection features. To this aim, we propose FLAD (adaptive Federated Learning Approach to DDoS attack detection), a FL solution for cybersecurity applications based on an adaptive mechanism that orchestrates the FL process by dynamically assigning more computation to those members whose attacks profiles are harder to learn, without the need of sharing any test data to monitor the performance of the trained model. Using a recent dataset of DDoS attacks, we demonstrate that FLAD outperforms the original FL algorithm in terms of convergence time and accuracy across a range of unbalanced datasets of heterogeneous DDoS attacks. We also show the robustness of our approach in a realistic scenario, where we retrain the deep learning model multiple times to introduce the profiles of new attacks on a pre-trained model.
翻译:联邦学习联合会(FL)最近越来越多地得到网络安全界的考虑,认为这是合作培训深层次学习模式的方法,其内容包括分布式网络威胁简介,而没有披露培训数据;然而,在网络安全方面采用FL仍然处于初级阶段,一系列实际方面尚未得到妥善解决;事实上,作为FL概念核心的Fred Average算法要求提供测试数据以控制FL进程。虽然在某些领域这可能是可行的,但如果不披露敏感信息,新发现袭击的深度模型网络交通就不能总是共享。在本文件中,我们处理FL进程在动态网络安全情景下的综合问题,在这个动态网络安全情景下,经过培训的模型必须经常更新,以便赋予联邦所有成员以最新检测特征的能力。为此,我们提议FLAD(适应式联邦教育联合会袭击检测的联邦学习方法),一个基于适应机制的FL进程应用FL模型前的FL解决方案,通过动态地将更多的计算方法分配给袭击概况较难了解的成员。我们不需要分享任何测试数据数据,以监测经过培训的FS袭击的准确性模型的运行情况。