具有特定度序列的抽样随机图表的一致标准 (Convergence criteria for sampling random graphs with specified degree sequences)

The configuration model is a standard tool for generating random graphs with a specified degree sequence, and is often used as a null model to evaluate how much of an observed network's structure is explained by its degrees alone. Except for networks with both self-loops and multi-edges, we lack a direct sampling algorithm for the configuration model, e.g., for simple graphs. A Markov chain Monte Carlo (MCMC) algorithm, based on a degree-preserving double-edge swap, provides an asymptotic solution to sample from the configuration model without bias. However, accurately detecting convergence of this Markov chain on its stationary distribution remains an unsolved problem. Here, we provide a concrete solution to detect convergence and sample from the configuration model without bias. We first develop an algorithm for estimating a sufficient gap between sampled MCMC states for them to be effectively independent. Applying this algorithm to a corpus of 509 empirical networks, we derive a set of computationally efficient heuristics, based on scaling laws, for choosing this sampling gap automatically. We then construct a convergence detection method that applies a Kolmogorov-Smirnov test to sequences of network assortativity values derived from the Markov chain's sampled states. Comparing this test to three generic Markov chain convergence diagnostics, we find that our method is both more accurate and more efficient at detecting convergence.

翻译：配置模型是生成带有特定度序列的随机图表的标准工具,通常被用作一个无效的模型,用以评价所观测网络结构中有多少被观测到的结构仅以度来解释。除了有自上层和多层平台的网络之外,我们缺乏一种配置模型的直接抽样算法,例如简单图。一个基于程度保存双向交换的Markov链蒙卡洛(MCMC)算法,它为配置模型样本提供了一种无偏差的现成解决办法。然而,准确地发现这个Markov链在固定分布上的趋同仍然是一个尚未解的问题。在这里,我们提供了一种具体的解决办法,用以检测配置模型的趋同和样本,而没有偏差。我们首先开发一种算法,用以估计抽样的MCMC(MC)国家之间的足够差距,以便它们能够有效地独立。我们将这一算法应用于509个实验网络的组合中,我们根据缩放法,得出一套计算效率高的超值,以便自动选择这一采样差距。我们随后在 Kolmologov-Smirovnov 测试3 中,我们用一种更精准的测测算方法来测试我们这个测算的标准化的统化链序列。

相关内容

马尔可夫链

关注 0

马尔可夫链，因安德烈·马尔可夫（A.A.Markov，1856－1922）得名，是指数学中具有马尔可夫性质的离散事件随机过程。该过程中，在给定当前知识或信息的情况下，过去（即当前以前的历史状态）对于预测将来（即当前以后的未来状态）是无关的。在马尔可夫链的每一步，系统根据概率分布，可以从一个状态变到另一个状态，也可以保持当前状态。状态的改变叫做转移，与不同的状态改变相关的概率叫做转移概率。随机漫步就是马尔可夫链的例子。随机漫步中每一步的状态是在图形中的点，每一步可以移动到任何一个相邻的点，在这里移动到每一个点的概率都是相同的（无论之前漫步路径是如何的）。

ICML2021图神经网络5篇论文的最新研究热点

专知会员服务

53+阅读 · 2021年6月14日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日