Using big data to analyze consumer behavior can provide effective decision-making tools for preventing customer attrition (churn) in customer relationship management (CRM). Focusing on a CRM dataset with several different categories of factors that impact customer heterogeneity (i.e., usage of self-care service channels, duration of service, and responsiveness to marketing actions), we provide new predictive analytics of customer churn rate based on a machine learning method that enhances the classification of logistic regression by adding a mixed penalty term. The proposed penalized logistic regression can prevent overfitting when dealing with big data and minimize the loss function when balancing the cost from the median (absolute value) and mean (squared value) regularization. We show the analytical properties of the proposed method and its computational advantage in this research. In addition, we investigate the performance of the proposed method with a CRM data set (that has a large number of features) under different settings by efficiently eliminating the disturbance of (1) least important features and (2) sensitivity from the minority (churn) class. Our empirical results confirm the expected performance of the proposed method in full compliance with the common classification criteria (i.e., accuracy, precision, and recall) for evaluating machine learning methods.
翻译:利用大数据分析消费者行为可提供有效的决策工具,防止客户在客户关系管理中出现客户减员(减员)现象(CRM),重点是由影响客户异质性的若干不同因素(即自我护理服务渠道的使用、服务期限和对营销行动的反应)组成的客户减员数据集,我们根据一种机器学习方法对客户的减员率进行新的预测分析,通过添加一个混合惩罚术语,加强后勤退缩的分类;拟议的惩罚性后勤退缩可防止在处理大数据时过度适应,并在从中位(绝对值)和中位(中等值)调整成本时最大限度地减少损失功能。我们展示了拟议方法的分析特性及其在这一研究中的计算优势。此外,我们用一个CRM数据集(具有大量特征)调查了拟议方法在不同环境中的性能,有效消除了(1) 最不重要特征的干扰和(2) 少数(寒级) 的敏感度。我们的经验结果证实了拟议方法在完全符合共同分类标准(i、准确性、回顾和精确性)情况下的预期绩效。