Clustering attempts to partition data instances into several distinctive groups, while the similarities among data belonging to the common partition can be principally reserved. Furthermore, incomplete data frequently occurs in many realworld applications, and brings perverse influence on pattern analysis. As a consequence, the specific solutions to data imputation and handling are developed to conduct the missing values of data, and independent stage of knowledge exploitation is absorbed for information understanding. In this work, a novel approach to clustering of incomplete data, termed leachable component clustering, is proposed. Rather than existing methods, the proposed method handles data imputation with Bayes alignment, and collects the lost patterns in theory. Due to the simple numeric computation of equations, the proposed method can learn optimized partitions while the calculation efficiency is held. Experiments on several artificial incomplete data sets demonstrate that, the proposed method is able to present superior performance compared with other state-of-the-art algorithms.
翻译:将数据实例分成几个不同的组别,而属于共同分区的数据之间的相似之处则可以主要保留。此外,在许多现实世界应用中经常出现不完全的数据,并对模式分析产生反常影响。因此,为数据缺失值制定了数据估算和处理的具体解决办法,并吸收了独立的知识开发阶段,以了解信息。在这项工作中,提出了将不完整数据(称为可浸出成分组别)分组的新办法。拟议方法与现行方法不同,它处理与贝斯对齐的数据估算,并在理论上收集丢失的模式。由于对方程式进行简单的数字计算,拟议方法可以在计算效率的同时学习优化的分区。对几个人工不完整数据集的实验表明,拟议方法能够显示优于其他最先进的算法。