Sampling edges from a graph in sublinear time is a fundamental problem and a powerful subroutine for designing sublinear-time algorithms. Suppose we have access to the vertices of the graph and know a constant-factor approximation to the number of edges. An algorithm for pointwise $\varepsilon$-approximate edge sampling with complexity $O(n/\sqrt{\varepsilon m})$ has been given by Eden and Rosenbaum [SOSA 2018]. This has been later improved by T\v{e}tek and Thorup [STOC 2022] to $O(n \log(\varepsilon^{-1})/\sqrt{m})$. At the same time, $\Omega(n/\sqrt{m})$ time is necessary. We close the problem, under the assumption of knowing $m$ up to a constant factor, for all but very dense graphs by giving an algorithm with complexity $O(n/\sqrt{m} + \log \varepsilon^{-1})$. Our algorithm is based on a new technique that we call \emph{Bernoulli trial simulation}. We believe this technique could also be useful for other problems. Given access to trials of the form $Bern(p)$, this technique allows us to simulate a Bernoulli trial $Bern(f(p) \pm \varepsilon)$ (without knowing $p$), in time complexity $O(\log \varepsilon^{-1})$ for some functions $f$. We specifically use this for $f(p) = 1/(2p)$ for $p \geq 2/3$. Therefore, we can perform rejection sampling, without the algorithm having to know the desired rejection probability. We conjecture Bernoulli trial simulation for $f(p) = 1/(2p)$ can be done exactly in expected $O(1)$ samples. This would lead to an exact algorithm for sampling an edge with complexity $O(n/\sqrt{m})$, completely resolving the problem of sampling an edge, again assuming rough knowledge of $m$. We consider the problem of removing this assumption to be an interesting open problem.
翻译:在亚线性时间的图形中, 取样边缘是一个根本性的问题, 而设计亚线性运算法的极强的亚律程。 假设我们能够访问图形的顶点, 并且知道对边缘数量的常态- 因素近似接近。 同时, 需要使用 $( varepsilon) 的近似边缘取样算法, 复杂 $O( n/\ sqrt\ varepsilon) 和 Rosenbaum (SOSSA 2018) 。 这一点后来通过 T\v{ e} 和 Shorup [STOC 2022] 和 美元( STOC 2022) 来改进。 使用 美元 美元( 美元= 美元 美元) 的直线性运算法( 美元/ qrqrt{m} 。 我们通过使用 美元( 美元) comp) 的算法, 也可以使用 美元 美元 。