It is reported that the number of online payment users in China has reached 854 million; with the emergence of community e-commerce platforms, the trend of integration of e-commerce and social applications is increasingly intense. Community e-commerce is not a mature and sound comprehensive e-commerce with fewer categories and low brand value. To effectively retain community users and fully explore customer value has become an important challenge for community e-commerce operators. Given the above problems, this paper uses the data-driven method to study the prediction of community e-commerce customers' repurchase behaviour. The main research contents include 1. Given the complex problem of feature engineering, the classic model RFM in the field of customer relationship management is improved, and an improved model is proposed to describe the characteristics of customer buying behaviour, which includes five indicators. 2. In view of the imbalance of machine learning training samples in SMOTE-ENN, a training sample balance using SMOTE-ENN is proposed. The experimental results show that the machine learning model can be trained more effectively on balanced samples. 3. Aiming at the complexity of the parameter adjustment process, an automatic hyperparameter optimization method based on the TPE method was proposed. Compared with other methods, the model's prediction performance is improved, and the training time is reduced by more than 450%. 4. Aiming at the weak prediction ability of a single model, the soft voting based RF-LightgBM model was proposed. The experimental results show that the RF-LighTGBM model proposed in this paper can effectively predict customer repurchase behaviour, and the F1 value is 0.859, which is better than the single model and previous research results.
翻译:据报道,中国在线支付用户人数已达8.54亿;随着社区电子商务平台的出现,社区电子商务和社会应用一体化的趋势日益强烈;社区电子商务不是一个成熟和健全的全面电子商务,类别较少,品牌价值低;为了有效留住社区用户和充分探讨客户价值,社区电子商务经营者面临重大挑战;鉴于上述问题,本文件使用数据驱动方法研究社区电子商务客户回购行为的预测;主要研究内容包括:1. 鉴于地貌工程的复杂问题,客户关系管理领域的典型RFM模型得到改进,并提议改进模式,说明客户购买行为的特点,其中包括五个指标;2. 鉴于SMOTE-ENN的机器学习培训样本不平衡,提议使用SMOTE-ENN的培训样本平衡; 实验结果显示,机器学习模型可以在平衡的样本上进行更有效的培训;1. 以参数调整过程的复杂程度为目标,以TPE 客户关系管理领域典型RFFM为主的自动超光度优化方法得到改进;4. 将SMOE-E-E-E-E-E-E-L的预测能力与单一的预测结果相比较,以更低的预测结果显示单一的预测结果。