Many well-established anomaly detection methods use the distance of a sample to those in its local neighbourhood: so-called `local outlier methods', such as LOF and DBSCAN. They are popular for their simple principles and strong performance on unstructured, feature-based data that is commonplace in many practical applications. However, they cannot learn to adapt for a particular set of data due to their lack of trainable parameters. In this paper, we begin by unifying local outlier methods by showing that they are particular cases of the more general message passing framework used in graph neural networks. This allows us to introduce learnability into local outlier methods, in the form of a neural network, for greater flexibility and expressivity: specifically, we propose LUNAR, a novel, graph neural network-based anomaly detection method. LUNAR learns to use information from the nearest neighbours of each node in a trainable way to find anomalies. We show that our method performs significantly better than existing local outlier methods, as well as state-of-the-art deep baselines. We also show that the performance of our method is much more robust to different settings of the local neighbourhood size.
翻译:许多公认的异常点探测方法使用与当地居民的距离:所谓的“本地异常方法”,如LOF和DBSCAN。这些方法因其简单的原则和在许多实际应用中常见的无结构、基于地貌的数据的强劲性能而很受欢迎。然而,由于缺乏可训练的参数,它们无法适应特定数据集。在本文中,我们首先通过显示这些方法是图形神经网络中使用的较一般信息传递框架的特殊案例来统一地方异常点方法。这使我们能够以神经网络的形式将学习性引入本地异常点方法,以便具有更大的灵活性和直观性:具体地说,我们建议LUNAR是一种新型、基于图形神经网络的异常检测方法。LUNAR学会以可训练的方式使用来自每个节点附近邻居的信息来发现异常点。我们的方法比现有的本地异常方法以及最先进的基线要好得多。我们还表明,我们的方法的性能对于当地街区的不同环境来说要强得多。