We present a comprehensive study on applying machine learning to detect distributed Denial of service (DDoS) attacks using large-scale Internet of Things (IoT) systems. While prior works and existing DDoS attacks have largely focused on individual nodes transmitting packets at a high volume, we investigate more sophisticated futuristic attacks that use large numbers of IoT devices and camouflage their attack by having each node transmit at a volume typical of benign traffic. We introduce new correlation-aware architectures that take into account the correlation of traffic across IoT nodes, and we also compare the effectiveness of centralized and distributed detection models. We extensively analyze the proposed architectures by evaluating five different neural network models trained on a dataset derived from a 4060-node real-world IoT system. We observe that long short-term memory (LSTM) and a transformer-based model, in conjunction with the architectures that use correlation information of the IoT nodes, provide higher performance (in terms of F1 score and binary accuracy) than the other models and architectures, especially when the attacker camouflages itself by following benign traffic distribution on each transmitting node. For instance, by using the LSTM model, the distributed correlation-aware architecture gives 81% F1 score for the attacker that camouflages their attack with benign traffic as compared to 35% for the architecture that does not use correlation information. We also investigate the performance of heuristics for selecting a subset of nodes to share their data for correlation-aware architectures to meet resource constraints.
翻译:我们提出了关于应用机器学习来检测使用大型互联网(IoT)系统进行分布式拒绝服务袭击(DDoS)的全面研究。虽然先前的工程和现有的DDoS攻击主要侧重于从数量庞大的单个节点传输包件,但我们调查了使用大量IoT装置的更尖端的未来攻击,并通过将每个节点传送到典型的无害交通量来掩盖其攻击。我们引入了考虑到跨IoT节点之间交通量的关联性(DDoS)的新的相关性,我们还比较了集中和分布式检测模型的有效性。我们广泛分析了拟议的结构,我们评估了五个不同的神经网络模型,这些模型是在4060-node真实世界IoT系统的一个数据集上培训的。我们发现,长期记忆(LSTM)和一个基于变异器的模型,这些模型使用与自然节点的相干量信息,提供了比其他模型和结构更高的性能(F1SA的分数和分数的精确性能),特别是当攻击者的模型本身通过遵循友好的交通流流流分配方式进行自我校正结构分析时,我们用81的数学结构来测量了它们的数据。