Gaining the trust of customers and providing them empathy are very critical in the financial domain. Frequent occurrence of fraudulent activities affects these two factors. Hence, financial organizations and banks must take utmost care to mitigate them. Among them, ATM fraudulent transaction is a common problem faced by banks. There following are the critical challenges involved in fraud datasets: the dataset is highly imbalanced, the fraud pattern is changing, etc. Owing to the rarity of fraudulent activities, Fraud detection can be formulated as either a binary classification problem or One class classification (OCC). In this study, we handled these techniques on an ATM transactions dataset collected from India. In binary classification, we investigated the effectiveness of various over-sampling techniques, such as the Synthetic Minority Oversampling Technique (SMOTE) and its variants, Generative Adversarial Networks (GAN), to achieve oversampling. Further, we employed various machine learning techniques viz., Naive Bayes (NB), Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), Gradient Boosting Tree (GBT), Multi-layer perceptron (MLP). GBT outperformed the rest of the models by achieving 0.963 AUC, and DT stands second with 0.958 AUC. DT is the winner if the complexity and interpretability aspects are considered. Among all the oversampling approaches, SMOTE and its variants were observed to perform better. In OCC, IForest attained 0.959 CR, and OCSVM secured second place with 0.947 CR. Further, we incorporated explainable artificial intelligence (XAI) and causal inference (CI) in the fraud detection framework and studied it through various analyses.
翻译:在金融领域,获得客户的信任和同情是十分关键的。欺诈活动的频繁发生影响到这两个因素。因此,金融组织和银行必须极其小心谨慎地减轻这两个因素。其中,自动取款机欺诈交易是银行面临的一个共同问题。欺诈数据集涉及以下关键挑战:数据集高度失衡,欺诈模式正在发生变化等。由于欺诈活动的罕见性,欺诈检测可以作为一种二进制分类问题或一个类别分类(OCC)来进行。在本研究中,我们在从印度收集的自动取款机交易数据集中处理了这些技术。在二进制分类中,我们调查了各种过度采样技术的有效性,如合成少数人过度采样技术(SMOTE)及其变体(Generalation Aversversarial 网络(GAN))及其变异体(Nive Bay Bayes (Nive Bayes(NB), 物流回流(LRR)、支持VCS(SMT)、决定树(DT)、 Rirmal-T(RB) 、Gread-BAR-DT(B)和Serview Streal-Seral、AVDA-Seral-Devely、A-S、Rest、Sy-Sy-L、Syal-Sy、Syal-Slation-L、Sy-I-Sy-Sl)和Sy-Sy-Sreal-SD、SD、A-I-Sl-SB、S、SD、Sl-Sl-SL、SDT、SDT、Risal-I-Sl-I-I-S、S、S、R-S、B、R-I-S、R-I-S-S-S、R-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S