While spectral clustering algorithms for undirected graphs are well established and have been successfully applied to unsupervised machine learning problems ranging from image segmentation and genome sequencing to signal processing and social network analysis, clustering directed graphs remains notoriously difficult. Two of the main challenges are that the eigenvalues and eigenvectors of graph Laplacians associated with directed graphs are in general complex-valued and that there is no universally accepted definition of clusters in directed graphs. We first exploit relationships between the graph Laplacian and transfer operators and in particular between clusters in undirected graphs and metastable sets in stochastic dynamical systems and then use a generalization of the notion of metastability to derive clustering algorithms for directed and time-evolving graphs. The resulting clusters can be interpreted as coherent sets, which play an important role in the analysis of transport and mixing processes in fluid flows.
翻译:虽然非定向图解的光谱集群算法已经确立,并成功地应用于未受监督的机器学习问题,从图像分解和基因组测序到信号处理和社会网络分析,但集群定向图解仍然难以解决,其中两个主要挑战是,与定向图解有关的图解拉平板图的光源值和源源值一般是复杂的,在定向图解中没有普遍接受的集群定义。我们首先利用图解 Laplacian 和传输操作员之间的关系,特别是非定向图解和传输操作员之间的关系,特别是非定向图解和随机动态系统中的元集和元集之间的关系,然后利用对可变性概念的概括化概念来为定向和时间动态图解析集群算法。由此形成的群集可被解释为一致的群集,在分析流中运输和混合过程方面发挥重要作用。