Graph-based clustering plays an important role in clustering tasks. As graph convolution network (GCN), a variant of neural networks on graph-type data, has achieved impressive performance, it is attractive to find whether GCNs can be used to augment the graph-based clustering methods on non-graph data, i.e., general data. However, given $n$ samples, the graph-based clustering methods usually need at least $O(n^2)$ time to build graphs and the graph convolution requires nearly $O(n^2)$ for a dense graph and $O(|\mathcal{E}|)$ for a sparse one with $|\mathcal{E}|$ edges. In other words, both graph-based clustering and GCNs suffer from severe inefficiency problems. To tackle this problem and further employ GCN to promote the capacity of graph-based clustering, we propose a novel clustering method, AnchorGAE. As the graph structure is not provided in general clustering scenarios, we first show how to convert a non-graph dataset into a graph by introducing the generative graph model, which is used to build GCNs. Anchors are generated from the original data to construct a bipartite graph such that the computational complexity of graph convolution is reduced from $O(n^2)$ and $O(|\mathcal{E}|)$ to $O(n)$. The succeeding steps for clustering can be easily designed as $O(n)$ operations. Interestingly, the anchors naturally lead to a siamese GCN architecture. The bipartite graph constructed by anchors is updated dynamically to exploit the high-level information behind data. Eventually, we theoretically prove that the simple update will lead to degeneration and a specific strategy is accordingly designed.
翻译:基于图形的组群在组群任务中起着重要作用。 图形变形网络( GCN) 是图型数据中神经网络的一种变种, 已经取得了令人印象深刻的性能, 找到GCN是否可以用于增强基于图形的非图形数据组组方法, 即一般数据。 但是, 以图为基础的组群方法通常至少需要 $O (n) 2 美元的时间来构建图表, 图形变形需要近于 $( n) =2) 美元, 用于密度的图形和 $( mathcal{E ⁇ ) 的神经网络网络网络网络网络已经取得了令人印象深刻的绩效。 换句话说, 基于图形的组群集和 GCN 是否能够用来增强基于图形的组群集能力, 我们建议一种新型的组群集方法不是在一般的组合假设中提供, 我们首先显示如何将一个非图表数据集转换成一张图表, $( $( mathcalalalalalalalalalalalalal crow) 运行模式, 将用来构建GOsal- deal deal- seal degradeal deal deal max dal dal dal dal dal 。 。 。 max max max max max lax lax max max max max lax max max max max lax romax romax max max max 。