Customer churn prediction is a challenging domain of research that contributes to customer retention strategy. The predictive performance of existing machine learning models, which are often adopted by churn communities, appear to be at a bottleneck, partly due to models' poor feature extraction capability. Therefore, a novel algorithm, a hybrid neural network with self-attention enhancement (HNNSAE), is proposed in this paper to improve the efficiency of feature screening and feature extraction, consequently improving the model's predictive performance. This model consists of three main blocks. The first block is the entity embedding layer, which is employed to process the categorical variables transformed into 0-1 code. The second block is the feature extractor, which extracts the significant features through the multi-head self-attention mechanism. In addition, to improve the feature extraction effect, we stack the residual connection neural network on multi-head self-attention modules. The third block is a classifier, which is a three-layer multilayer perceptron. This work conducts experiments on publicly available dataset related to commercial bank customers. The result demonstrates that HNNSAE significantly outperforms the other Individual Machine Learning (IML), Ensemble Machine Learning (EML), and Deep Learning (DL) methods tested in this paper. Furthermore, we compare the performance of the feature extractor proposed in this paper with that of other three feature extractors and find that the method proposed in this paper significantly outperforms other methods. In addition, four hypotheses about model prediction performance and overfitting risk are tested on the publicly available dataset.
翻译:客户热量预测是一个具有挑战性的研究领域,有助于客户保留战略。现有机器学习模型通常被胡椒社区采用,其预测性表现似乎处于瓶颈状态,部分原因是模型的特征提取能力差。因此,本文件提议了一个新型算法,即具有自我注意增强作用的混合神经网络(HNNSAE),目的是提高特征筛选和特征提取的效率,从而改进模型的预测性能。这个模型由三个主要部分组成。第一个单元是实体嵌入层,用于处理转换成0-1代码的绝对变量。第二个单元是功能提取器,通过多头自我注意机制提取重要特征。此外,为了改进特征提取效应,我们把残余连接神经网络堆放在多头自我注意模块上,从而改进模型的预测性能。第三块是一个分类器,这是一个三层模型的多层透视谱。这项工作是针对商业银行客户公开提供的数据集的实验。结果显示, HNNSAE, 用于处理成 0-1 代码的绝对变量。第二个单元是功能提取器,通过多头的自我观察机制的自我测算方法, 本次测试了其他磁质测试的模型的模型, 测试了其他的模型的测试方法。