Detecting money laundering in gambling is becoming increasingly challenging for the gambling industry as consumers migrate to online channels. Whilst increasingly stringent regulations have been applied over the years to prevent money laundering in gambling, despite this, online gambling is still a channel for criminals to spend proceeds from crime. Complementing online gambling's growth more concerns are raised to its effects compared with gambling in traditional, physical formats, as it might introduce higher levels of problem gambling or fraudulent behaviour due to its nature of immediate interaction with online gambling experience. However, in most cases the main issue when organisations try to tackle those areas is the absence of high quality data. Since fraud detection related issues face the significant problem of the class imbalance, in this paper we propose a novel system based on Generative Adversarial Networks (GANs) for generating synthetic data in order to train a supervised classifier. Our framework Synthetic Data Generation GAN (SDG-GAN), manages to outperformed density based over-sampling methods and improve the classification performance of benchmarks datasets and the real world gambling fraud dataset.
翻译:在赌博中发现洗钱行为对赌博业越来越具有挑战性,因为消费者移徙到网上渠道。尽管多年来为防止赌博洗钱实施了越来越严格的条例,尽管如此,网上赌博仍然是犯罪分子使用犯罪收益的渠道。 与传统、物理形式的赌博相比,在线赌博的增长对赌博的影响提出了更多的关注,因为赌博可能带来更高的问题赌博或欺诈行为,因为赌博与网上赌博直接互动的性质。然而,在大多数情况下,各组织试图解决这些问题的主要问题是缺乏高质量的数据。 由于欺诈发现相关问题面临阶级失衡的严重问题,我们在本文件中提议建立一个基于“创用自动网”的新系统,用于生成合成数据,以培训监管的分类者。我们的框架合成数据生成 GAN(SDG-GAN)(SDGAN)(SDGAN)(SDGAN)(SD)(SDGAN)(SDGD)(SD)(Suproad), 管理着超越基于过度抽样方法的密度,并改进基准数据集和真实世界赌博欺诈数据集的分类绩效。