Money laundering is a profound global problem. Nonetheless, there is little scientific literature on statistical and machine learning methods for anti-money laundering. In this paper, we focus on anti-money laundering in banks and provide an introduction and review of the literature. We propose a unifying terminology with two central elements: (i) client risk profiling and (ii) suspicious behavior flagging. We find that client risk profiling is characterized by diagnostics, i.e., efforts to find and explain risk factors. On the other hand, suspicious behavior flagging is characterized by non-disclosed features and hand-crafted risk indices. Finally, we discuss directions for future research. One major challenge is the need for more public data sets. This may potentially be addressed by synthetic data generation. Other possible research directions include semi-supervised and deep learning, interpretability, and fairness of the results.
翻译:洗钱是一项严重的全球性问题。尽管如此,在反洗钱方面的统计和机器学习方法方面的科学文献很少。在本文中,我们专注于银行反洗钱,并提供一个介绍和文献综述。我们提出了一个统一的术语,包括两个中心元素:(i)客户风险分析和(ii)可疑行为标记。我们发现,客户风险分析特点在于诊断,即寻找和解释风险因素。另一方面,可疑行为标记的特点是非公开特征和手工风险指数。最后,我们讨论了未来研究的方向。其中一个主要挑战是需要更多的公共数据集。这可以通过合成数据生成来解决。其他可能的研究方向包括半监督和深度学习,结果的可解释性和公正性。