Fraud detection is to identify, monitor, and prevent potentially fraudulent activities from complex data. The recent development and success in AI, especially machine learning, provides a new data-driven way to deal with fraud. From a methodological point of view, machine learning based fraud detection can be divided into two categories, i.e., conventional methods (decision tree, boosting...) and deep learning, both of which have significant limitations in terms of the lack of representation learning ability for the former and interpretability for the latter. Furthermore, due to the rarity of detected fraud cases, the associated data is usually imbalanced, which seriously degrades the performance of classification algorithms. In this paper, we propose deep boosting decision trees (DBDT), a novel approach for fraud detection based on gradient boosting and neural networks. In order to combine the advantages of both conventional methods and deep learning, we first construct soft decision tree (SDT), a decision tree structured model with neural networks as its nodes, and then ensemble SDTs using the idea of gradient boosting. In this way we embed neural networks into gradient boosting to improve its representation learning capability and meanwhile maintain the interpretability. Furthermore, aiming at the rarity of detected fraud cases, in the model training phase we propose a compositional AUC maximization approach to deal with data imbalances at algorithm level. Extensive experiments on several real-life fraud detection datasets show that DBDT can significantly improve the performance and meanwhile maintain good interpretability. Our code is available at https://github.com/freshmanXB/DBDT.
翻译:发现欺诈是为了查明、监测和防止复杂的数据中潜在的欺诈活动。AI最近的发展和成功,特别是机器学习,为处理欺诈提供了一种新的数据驱动方法。从方法观点来看,机器学习的欺诈检测可以分为两类,即常规方法(决定树,促进...)和深层次学习,两者在前者缺乏代表性学习能力以及后者的可解释性方面都有重大限制。此外,由于所发现的欺诈案件很少见,相关数据通常不平衡,严重地降低了分类算法的性能。在本文件中,我们提议深度提升决策树(DBDDDT),这是基于梯度增强和神经网络的新的欺诈检测方法。为了将常规方法的优势和深层次学习结合起来,我们首先建造软决策树(SDDTD),一个决定树结构模型,以神经网络为节点,然后利用梯度增强的理念将SDDTDD数据混合起来,从而大大地降低分类算法的效性能。我们用这种方式将神经网络嵌入加速度提升其代表性学习能力,同时在梯度增强和神经系统分析阶段里,我们用测算数据测测算数据。