Approximate spectral clustering (ASC) was developed to overcome heavy computational demands of spectral clustering (SC). It maintains SC ability in predicting non-convex clusters. Since it involves a preprocessing step, ASC defines new similarity measures to assign weights on graph edges. Connectivity matrix (CONN) is an efficient similarity measure to construct graphs for ASC. It defines the weight between two vertices as the number of points assigned to them during vector quantization training. However, this relationship is undirected, where it is not clear which of the vertices is contributing more to that edge. Also, CONN could be tricked by noisy density between clusters. We defined a directed version of CONN, named DCONN, to get insights on vertices contributions to edges. Also, we provided filtering schemes to ensure CONN edges are highlighting potential clusters. Experiments reveal that the proposed filtering was highly efficient when noise cannot be tolerated by CONN.
翻译:开发近光谱聚集(ASC)是为了克服光谱聚集(SC)的沉重计算需求。 它保持了SC在预测非电解聚集方面的能力。 由于它涉及一个预处理步骤, ASC定义了在图形边缘分配重量的新的相似性措施。 连通性矩阵(CONN)是构建 ASC 图表的一种有效的相似性措施。 它将两个顶点之间的重量定义为矢量量化培训期间分配给它们的点数。 但是,这种关系是非定向的, 不清楚哪个顶点对边缘的贡献更大。 另外, CONN 也可能被各组之间的密度所操纵。 我们定义了一个名为 DCONN 的CONN 定向版本, 以了解对边缘的垂直贡献。 此外, 我们提供了过滤计划,以确保CONN 边缘突出潜在集群。 实验显示,在CONN 无法容忍噪音的情况下,拟议的过滤效率很高。