Anomalies represent rare observations (e.g., data records or events) that deviate significantly from others. Over several decades, research on anomaly mining has received increasing interests due to the implications of these occurrences in a wide range of disciplines. Anomaly detection, which aims to identify rare observations, is among the most vital tasks in the world, and has shown its power in preventing detrimental events, such as financial fraud, network intrusion, and social spam. The detection task is typically solved by identifying outlying data points in the feature space and inherently overlooks the relational information in real-world data. Graphs have been prevalently used to represent the structural information, which raises the graph anomaly detection problem - identifying anomalous graph objects (i.e., nodes, edges and sub-graphs) in a single graph, or anomalous graphs in a database/set of graphs. However, conventional anomaly detection techniques cannot tackle this problem well because of the complexity of graph data. For the advent of deep learning, graph anomaly detection with deep learning has received a growing attention recently. In this survey, we aim to provide a systematic and comprehensive review of the contemporary deep learning techniques for graph anomaly detection. We compile open-sourced implementations, public datasets, and commonly-used evaluation metrics to provide affluent resources for future studies. More importantly, we highlight twelve extensive future research directions according to our survey results covering unsolved and emerging research problems and real-world applications. With this survey, our goal is to create a "one-stop-shop" that provides a unified understanding of the problem categories and existing approaches, publicly available hands-on resources, and high-impact open challenges for graph anomaly detection using deep learning.
翻译:异常现象是罕见的观测(例如,数据记录或事件),这些观测与其他观测有明显的不同。数十年来,异常采矿研究由于在广泛的学科中的影响而越来越引起人们的兴趣。异常采矿研究由于这些异常采矿活动的影响而越来越引起人们的兴趣。异常探测的目的是确定稀有的观测,是全世界最重要的任务之一,显示了它在防止金融欺诈、网络入侵、社会垃圾等有害事件方面的力量。发现任务通常通过查明特征空间中的外围数据点来解决,并固有地忽略了真实世界数据中的关联性信息。图表被普遍用来代表结构性信息,这增加了图表异常应用类别的检测问题:在单一的图表中确定异常图对象(例如,节点、边缘和子图层),或数据库/图集中的异常图表。但是,常规异常探测技术无法很好地解决这一问题,因为图形数据的复杂性。为了进行深入的学习,以图表的形式探测现有异常现象,最近人们越来越关注这些现象。在本次调查中,我们的目标是提供当前异常现象应用的系统、全面的模型研究,我们的目的是为了进行更深入的统计研究,我们用来进行更深入的研究。