Variational inference has been widely used in machine learning literature to fit various Bayesian models. In network analysis, this method has been successfully applied to solve the community detection problems. Although these results are promising, their theoretical support is only for relatively dense networks, an assumption that may not hold for real networks. In addition, it has been shown recently that the variational loss surface has many saddle points, which may severely affect its performance, especially when applied to sparse networks. This paper proposes a simple way to improve the variational inference method by hard thresholding the posterior of the community assignment after each iteration. Using a random initialization that correlates with the true community assignment, we show that the proposed method converges and can accurately recover the true community labels, even when the average node degree of the network is bounded. Extensive numerical study further confirms the advantage of the proposed method over the classical variational inference and another state-of-the-art algorithm.
翻译:在机器学习文献中广泛使用变式推论,以适合贝耶斯人的各种模型。在网络分析中,这种方法被成功地应用于解决社区探测问题。虽然这些结果很有希望,但理论支持只针对相对密集的网络,而这种假设可能并不支持真正的网络。此外,最近还显示变式损失表面有许多马鞍点,这可能严重影响其性能,特别是在应用到稀少的网络时。本文件提出了一个简单的方法来改进变式推论方法,在每次迭代后硬阈值社区分配的后方。我们使用与真正的社区任务相关的随机初始化方法,我们表明拟议的方法会汇合并能够准确地恢复真正的社区标签,即使网络的平均节点被捆绑。广泛的数字研究进一步证实了拟议方法相对于传统变式推论和另一个最先进的算法的优势。