Many applications require randomly sampling bipartite graphs with fixed degrees, or randomly sampling incidence matrices with fixed row and column sums. Although several sampling algorithms exist, the curveball algorithm is the most efficient, running in O(n log n) time, and has been proven to sample uniformly at random. In this paper, we introduce the fastball algorithm, which adopts a similar approach but runs in O(n) time. We show that fastball randomly samples large bipartite graphs with fixed degrees more than four times faster than curveball, and illustrate the value of this faster algorithm in the context of the fixed degree sequence model for backbone extraction.
翻译:许多应用程序都要求随机抽样使用固定度的双边图表,或随机抽样使用固定线和列总和的双边图表。虽然存在几种抽样算法,但曲线球算法是最有效率的,在O(nlognn)时间运行,并已被证明可以统一随机抽样。在本文件中,我们引入快速球算法,它采用类似方法,但在O(n)时间运行。我们显示快速球随机抽取大型双边图表,固定度比曲线球快四倍以上,并在骨干提取固定度序列模型中说明这一更快算法的价值。