Fraud detection problems are usually formulated as a machine learning problem on a graph. Recently, Graph Neural Networks (GNNs) have shown solid performance on fraud detection. The successes of most previous methods heavily rely on rich node features and high-fidelity labels. However, labeled data is scarce in large-scale industrial problems, especially for fraud detection where new patterns emerge from time to time. Meanwhile, node features are also limited due to privacy and other constraints. In this paper, two improvements are proposed: 1) We design a graph transformation method capturing the structural information to facilitate GNNs on non-attributed fraud graphs. 2) We propose a novel graph pre-training strategy to leverage more unlabeled data via contrastive learning. Experiments on a large-scale industrial dataset demonstrate the effectiveness of the proposed framework for fraud detection.
翻译:最近,图表神经网络(GNNs)在发现欺诈方面表现良好。大多数以往方法的成功在很大程度上依赖于丰富的节点特征和高忠诚标签。然而,在大规模工业问题中,标签数据很少,特别是在新模式不时出现时出现时的欺诈探测方面。同时,节点特征也因隐私和其他制约因素而受到限制。本文件提出了两项改进建议:(1) 我们设计一个图表转换方法,收集结构信息,以便利无归属欺诈图表的GNNs。(2) 我们提出一个新的图表预培训战略,通过对比性学习利用更多的无标签数据。大规模工业数据集实验显示了拟议中的欺诈检测框架的有效性。