Clustering is an unsupervised data mining technique that can be employed to segment customers. The efficient clustering of customers enables banks to design and make offers based on the features of the target customers. The present study uses a real-world financial dataset (Berka, 2000) to cluster bank customers by an encoder-decoder network and the dynamic time warping (DTW) method. The customer features required for clustering are obtained in four ways: Dynamic Time Warping (DTW), Recency Frequency and Monetary (RFM), LSTM encoder-decoder network, and our proposed hybrid method. Once the LSTM model was trained by customer transaction data, a feature vector of each customer was automatically extracted by the encoder.Moreover, the distance between pairs of sequences of transaction amounts was obtained using DTW. Another vector feature was calculated for customers by RFM scoring. In the hybrid method, the feature vectors are combined from the encoder-decoder output, the DTW distance, and the demographic data (e.g., age and gender). Finally, feature vectors were introduced as input to the k-means clustering algorithm, and we compared clustering results with Silhouette and Davies-Bouldin index. As a result, the clusters obtained from the hybrid approach are more accurate and meaningful than those derived from individual clustering techniques. In addition, the type of neural network layers had a substantial effect on the clusters, and high network error does not necessarily worsen clustering performance.
翻译:集群是一种非监督的数据挖掘技术,可以用于部分客户; 高效率的客户集群使银行能够根据目标客户的特征设计和提出报价; 本研究使用一个编码器解码器网络和动态时间扭曲方法向分组银行客户提供真实世界的金融数据集(Berka, 2000年); 集群所需的客户特征以四种方式获得: 动态时间转换(DTW)、 耐变频率和货币(RFM)、 LSTM 编码交换器网络和我们提议的混合方法。 一旦LSTM模型经过客户交易数据分类的培训,每个客户的特性矢量就由编码器自动提取。 Moreover,使用DTW获得交易量序列的对对配距离。 另一种矢量特性是用RFM的评分为客户计算的。 在混合方法中,特性矢量是来自编码器-脱码输出、 DTW的距离和人口数据(例如,年龄和性别)。 最后,每个客户集群的特性矢量矢量矢量矢量矢量由Sildal-commal 生成的结果,而不是从我们从数据库和数字分组中获取。