在动态图表中采用双向方法快速和准确的异常探测 (Fast and Accurate Anomaly Detection in Dynamic Graphs with a Two-Pronged Approach)

Given a dynamic graph stream, how can we detect the sudden appearance of anomalous patterns, such as link spam, follower boosting, or denial of service attacks? Additionally, can we categorize the types of anomalies that occur in practice, and theoretically analyze the anomalous signs arising from each type? In this work, we propose AnomRank, an online algorithm for anomaly detection in dynamic graphs. AnomRank uses a two-pronged approach defining two novel metrics for anomalousness. Each metric tracks the derivatives of its own version of a 'node score' (or node importance) function. This allows us to detect sudden changes in the importance of any node. We show theoretically and experimentally that the two-pronged approach successfully detects two common types of anomalies: sudden weight changes along an edge, and sudden structural changes to the graph. AnomRank is (a) Fast and Accurate: up to 49.5x faster or 35% more accurate than state-of-the-art methods, (b) Scalable: linear in the number of edges in the input graph, processing millions of edges within 2 seconds on a stock laptop/desktop, and (c) Theoretically Sound: providing theoretical guarantees of the two-pronged approach.

翻译：在动态图表流中, 我们如何检测异常模式的突然外观, 如链接垃圾、跟踪器助推或拒绝服务攻击等? 此外, 我们能否对实际中出现的异常类型进行分类, 并从理论上分析每种类型的异常迹象? 在这项工作中, 我们提出 AnomRank, 用于动态图形中异常检测的在线算法。 AnomRank 使用双管齐下的方法, 定义两种异常现象的新指标。每个指标都跟踪其“ 结点评分( 或节点重要性) ” 功能版本的衍生物。这使我们能够检测任何节点重要性的突然变化。我们从理论上和实验上显示, 双管齐下的方法成功检测出两种常见的异常类型: 边缘的突然重量变化, 以及图形的突然结构变化。 AnomRank 是 (a) 快速和准确度: 最高至49.5x 或比最新设计方法更准确35 % 。 (b) 可升级: 在任何节点的边缘的线性变化中, 双轨方法中提供双向的双向的理论平台/ 。

相关内容

异常检测

关注 102

在数据挖掘中，异常检测（英语：anomaly detection）对不符合预期模式或数据集中其他项目的项目、事件或观测值的识别。通常异常项目会转变成银行欺诈、结构缺陷、医疗问题、文本错误等类型的问题。异常也被称为离群值、新奇、噪声、偏差和例外。特别是在检测滥用与网络入侵时，有趣性对象往往不是罕见对象，但却是超出预料的突发活动。这种模式不遵循通常统计定义中把异常点看作是罕见对象，于是许多异常检测方法（特别是无监督的方法）将对此类数据失效，除非进行了合适的聚集。相反，聚类分析算法可能可以检测出这些模式形成的微聚类。有三大类异常检测方法。[1] 在假设数据集中大多数实例都是正常的前提下，无监督异常检测方法能通过寻找与其他数据最不匹配的实例来检测出未标记测试数据的异常。监督式异常检测方法需要一个已经被标记“正常”与“异常”的数据集，并涉及到训练分类器（与许多其他的统计分类问题的关键区别是异常检测的内在不均衡性）。半监督式异常检测方法根据一个给定的正常训练数据集创建一个表示正常行为的模型，然后检测由学习模型生成的测试实例的可能性。

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【CMU博士论文】使用静态和动态图来异常检测，Mining Anomalies using Static and Dynamic Graphs

专知会员服务

68+阅读 · 2020年5月26日

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日