This work proposes a clusterization algorithm called k-Morphological Sets (k-MS), based on morphological reconstruction and heuristics. k-MS is faster than the CPU-parallel k-Means in worst case scenarios and produces enhanced visualizations of the dataset as well as very distinct clusterizations. It is also faster than similar clusterization methods that are sensitive to density and shapes such as Mitosis and TRICLUST. In addition, k-MS is deterministic and has an intrinsic sense of maximal clusters that can be created for a given input sample and input parameters, differing from k-Means and other clusterization algorithms. In other words, given a constant k, a structuring element and a dataset, k-MS produces k or less clusters without using random/ pseudo-random functions. Finally, the proposed algorithm also provides a straightforward means for removing noise from images or datasets in general.
翻译:这项工作提出了一种基于形态重建和超常学的群集算法(k-MS),称为k-Morphlogic Sets(k-MS)。 k-MS在最坏的情况下比CPU-parllel k-Means(CPU-parllel k-Means)更快,并产生数据集的强化可视化和非常明显的群集化。它也比类似对密度和形状敏感的群集法(如Mitosis和TRICLUST)更快。此外, k-MS(k-MS)是决定性的,具有为特定输入样本和输入参数(与 k-Means 和其他集成算法不同)而创建的最大群集的内在感。换句话说,由于常态 k、 结构元素和数据集, k- MS( MS) 不使用随机/ 假随机功能而产生 K 或更少的群集。最后, 拟议的算法还提供了从一般的图像或数据集中删除噪音的简单手段。