Diffusion is a fundamental graph procedure and has been a basic building block in a wide range of theoretical and empirical applications such as graph partitioning and semi-supervised learning on graphs. In this paper, we study computationally efficient diffusion primitives beyond random walk. We design an $\widetilde{O}(m)$-time randomized algorithm for the $\ell_2$-norm flow diffusion problem, a recently proposed diffusion model based on network flow with demonstrated graph clustering related applications both in theory and in practice. Examples include finding locally-biased low conductance cuts. Using a known connection between the optimal dual solution of the flow diffusion problem and the local cut structure, our algorithm gives an alternative approach for finding such cuts in nearly linear time. From a technical point of view, our algorithm contributes a novel way of dealing with inequality constraints in graph optimization problems. It adapts the high-level algorithmic framework of nearly linear time Laplacian system solvers, but requires several new tools: vertex elimination under constraints, a new family of graph ultra-sparsifiers, and accelerated proximal gradient methods with inexact proximal mapping computation.
翻译:扩散是一个基本的图表程序,并且是一系列广泛的理论和实验应用的基本组成部分,例如图形分割和半监督的图表学习。在本文中,我们研究了随机行走以外的计算高效扩散原始物。我们设计了一个$\\ el_ 2$- norm流扩散问题的时间随机算法,这是最近提出的基于网络流的传播模型,在理论和实践上都演示了图形组合的应用。例子包括找到当地偏差的低导力削减。使用流动扩散问题最佳双解法与本地剪切断结构之间的已知联系,我们的算法提供了一种在近线性时间找到这种削减的替代方法。从技术角度看,我们的算法为处理图形优化问题中的不平等限制提供了一种新的方法。它调整了近线性拉氏系统解算法的高水平算法框架,但需要几种新的工具:在限制下消除垂直,一个新的图形超分解器组合,以及加速的直径成形梯度计算方法。