This article sets forth a review of knowledge distillation techniques with a focus on their applicability to retail banking contexts. Predictive machine learning algorithms used in banking environments, especially in risk and control functions, are generally subject to regulatory and technical constraints limiting their complexity. Knowledge distillation gives the opportunity to improve the performances of simple models without burdening their application, using the results of other - generally more complex and better-performing - models. Parsing recent advances in this field, we highlight three main approaches: Soft Targets, Sample Selection and Data Augmentation. We assess the relevance of a subset of such techniques by applying them to open source datasets, before putting them to the test on the use cases of BPCE, a major French institution in the retail banking sector. As such, we demonstrate the potential of knowledge distillation to improve the performance of these models without altering their form and simplicity.
翻译:本篇文章回顾了知识蒸馏技术,重点是其适用于零售银行业的情况。在银行业环境中,特别是在风险和控制功能中,使用的预测机器学习算法一般受到监管和技术限制,限制其复杂性。知识蒸馏利用其他一般更为复杂和业绩较好的模型的结果,有机会改进简单模型的性能,而不会使其应用负担过重。我们分析该领域的最近进展,强调三个主要方法:软目标、抽样选择和数据增强。我们评估了这些技术的一组相关性,将这些技术应用于开放源数据集,然后将其用于测试零售银行业部门法国一家主要机构BPCE的使用案例。因此,我们展示了知识蒸馏的潜力,以改进这些模型的性能,而不会改变其形式和简单性。