使用基于频率的 Strach 数据结构在边缘流中分布异常探测 (Distributed Anomaly Detection in Edge Streams using Frequency based Sketch Datastructures)

Often logs hosted in large data centers represent network traffic data over a long period of time. For instance, such network traffic data logged via a TCP dump packet sniffer (as considered in the 1998 DARPA intrusion attack) included network packets being transmitted between computers. While an online framework is necessary for detecting any anomalous or suspicious network activities like denial of service attacks or unauthorized usage in real time, often such large data centers log data over long periods of time (e.g., TCP dump) and hence an offline framework is much more suitable in such scenarios. Given a network log history of edges from a dynamic graph, how can we assign anomaly scores to individual edges indicating suspicious events with high accuracy using only constant memory and within limited time than state-of-the-art methods? We propose MDistrib and its variants which provides (a) faster detection of anomalous events via distributed processing with GPU support compared to other approaches, (b) better false positive guarantees than state of the art methods considering fixed space and (c) with collision aware based anomaly scoring for better accuracy results than state-of-the-art approaches. We describe experiments confirming that MDistrib is more efficient than prior work.

翻译：大型数据中心的日志往往代表长期的网络流量数据。例如,这种网络流量数据通过TCP垃圾袋嗅探器(如1998年DARPA入侵攻击案所考虑的)登录的网络流量数据包括计算机之间传送的网络包。虽然一个在线框架对于发现任何异常或可疑的网络活动是必要的,如拒绝服务攻击或实时未经授权使用等,但与其它方法相比,这类大型数据中心长时间(如TCP倾弃)的大型日志数据往往更适合这种情况。鉴于动态图显示的边缘的网络记录历史,我们如何将异常分数分配给显示可疑事件的个别边缘,仅使用恒定的内存并在比最新方法有限的时间内显示高度精确的可疑事件?我们建议采用计量吸入器及其变式,以便(a)通过分散的处理和GPU支持更快地探测异常事件,(b)比考虑固定空间时的艺术方法状况更好的假正保证,以及(c)以具有碰撞意识的异常点评分比州-艺术方法更准确的结果更有效。我们提出MDrib的实验比以前更能证实MDrib实验。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【微软-Victor Bahl】边缘计算，49页ppt，Edge Computing for Infrastructure

专知会员服务

56+阅读 · 2020年4月13日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日