Clustering ensemble, or consensus clustering, has emerged as a powerful tool for improving both the robustness and the stability of results from individual clustering methods. Weighted clustering ensemble arises naturally from clustering ensemble. One of the arguments for weighted clustering ensemble is that elements (clusterings or clusters) in a clustering ensemble are of different quality, or that objects or features are of varying significance. However, it is not possible to directly apply the weighting mechanisms from classification (supervised) domain to clustering (unsupervised) domain, also because clustering is inherently an ill-posed problem. This paper provides an overview of weighted clustering ensemble by discussing different types of weights, major approaches to determining weight values, and applications of weighted clustering ensemble to complex data. The unifying framework presented in this paper will help clustering practitioners select the most appropriate weighting mechanisms for their own problems.
翻译:组合组合或协商一致集群已成为提高单个集群方法结果的稳健性和稳定性的有力工具,加权组合组合组合从组合组合组合共同体中自然产生。加权组合组合组合合体的论据之一是组合组合组合中的要素(集群或集群)质量不同,或物体或特征不同,但不可能直接将从分类(监督)领域到集群(非监督)领域的加权机制应用到集群(非监督)领域,也因为集群本身就是一个问题。本文通过讨论不同种类的权重、确定权重价值的主要方法和加权组合组合对复杂数据的应用,对加权组合组合组合组合组合组合组合组合组合的组合作了概述。本文件提出的统一框架将有助于分类工作者为自身问题选择最适当的加权机制。