Among many clustering algorithms, the K-means clustering algorithm is widely used because of its simple algorithm and fast convergence. However, this algorithm suffers from incomplete data, where some samples have missed some of their attributes. To solve this problem, we mainly apply MM principles to restore the symmetry of the data, so that K-means could work well. We give the pseudo-code of the algorithm and use the standard datasets for experimental verification. The source code for the experiments is publicly available in the following link: \url{https://github.com/AliBeikmohammadi/MM-Optimization/blob/main/mini-project/MM%20K-means.ipynb}.
翻译:在许多组群算法中,K- means群集算法因其简单的算法和快速趋同而得到广泛使用。 但是,这种算法由于数据不完整而受到影响,有些样本错过了其中的一些属性。为了解决这个问题,我们主要应用MM原则来恢复数据的对称性,以便K-points能够很好地发挥作用。我们给算法提供假码,并使用标准数据集进行实验核查。实验的源代码可以在以下链接中公开查阅:\url{https://github.com/AliBeikmohammadi/MM-Optiminization/b/main/mini-project/MM%20K-moines.ipynb}。