Most of the existing clustering methods are based on a single granularity of information, such as the distance and density of each data. This most fine-grained based approach is usually inefficient and susceptible to noise. Therefore, we propose a clustering algorithm that combines multi-granularity Granular-Ball and minimum spanning tree (MST). We construct coarsegrained granular-balls, and then use granular-balls and MST to implement the clustering method based on "large-scale priority", which can greatly avoid the influence of outliers and accelerate the construction process of MST. Experimental results on several data sets demonstrate the power of the algorithm. All codes have been released at https://github.com/xjnine/GBMST.
翻译:现有的大多数聚类方法都是基于单一信息的粒度,例如数据的距离和密度等。这种最细粒度的方法通常效率较低,易受噪音干扰。因此,我们提出了一种聚类算法,该算法结合了多粒度Granul