This article presents the data used to evaluate the performance of evolutionary clustering algorithm star (ECA*) compared to five traditional and modern clustering algorithms. Two experimental methods are employed to examine the performance of ECA* against genetic algorithm for clustering++ (GENCLUST++), learning vector quantisation (LVQ) , expectation maximisation (EM) , K-means++ (KM++) and K-means (KM). These algorithms are applied to 32 heterogenous and multi-featured datasets to determine which one performs well on the three tests. For one, ther paper examines the efficiency of ECA* in contradiction of its corresponding algorithms using clustering evaluation measures. These validation criteria are objective function and cluster quality measures. For another, it suggests a performance rating framework to measurethe the performance sensitivity of these algorithms on varos dataset features (cluster dimensionality, number of clusters, cluster overlap, cluster shape and cluster structure). The contributions of these experiments are two-folds: (i) ECA* exceeds its counterpart aloriths in ability to find out the right cluster number; (ii) ECA* is less sensitive towards dataset features compared to its competitive techniques. Nonetheless, the results of the experiments performed demonstrate some limitations in the ECA*: (i) ECA* is not fully applied based on the premise that no prior knowledge exists; (ii) Adapting and utilising ECA* on several real applications has not been achieved yet.
翻译:本文介绍了用来评价进化组合算法星(ECA*)的性能的数据,而与五个传统和现代组合算法相比,这5个传统和现代组合算法是用来评价进化组合算法星(ECA*)的性能的。采用了两种实验方法,对照集群+(GENCLUST+++)、矢量量量计算(LVQ),学习矢量量化(LVQ),期望最大化(EM),K-Ues++(KM++)和K-Um(KM)。这些算法适用于32个异质和多功能数据集,以确定在3个测试中谁表现得好。对于非洲经委会来说,该论文审查了与其使用集群评价措施的相应算法相矛盾的效率。这些验证标准是客观功能和集群质量衡量的计量。另一个实验,它提出了一个业绩评级框架,以衡量这些算法对 varoes数据集特征(集群尺寸、集群数目、集群重叠、集群形状和群集结构结构结构结构)的性贡献有两重:(一)非洲经委会* 发现正确组数组数能力中的对应等的比能力分析; (二) 非洲经委会在前的试验中,没有充分显示前的成绩限制。