We present a sublinear time algorithm that allows one to sample multiple edges from a distribution that is pointwise $\epsilon$-close to the uniform distribution, in an \emph{amortized-efficient} fashion. We consider the adjacency list query model, where access to a graph $G$ is given via degree and neighbor queries. The problem of sampling a single edge in this model has been raised by Eden and Rosenbaum (SOSA 18). Let $n$ and $m$ denote the number of vertices and edges of $G$, respectively. Eden and Rosenbaum provided upper and lower bounds of $\Theta^*(n/\sqrt m)$ for sampling a single edge in general graphs (where $O^*(\cdot)$ suppresses $\poly(1/\epsilon)$ and $\poly(\log n)$ dependencies). We ask whether the query complexity lower bound for sampling a single edge can be circumvented when multiple samples are required. That is, can we get an improved amortized per-sample cost if we allow a preprocessing phase? We answer in the affirmative. We present an algorithm that, if one knows the number of required samples $q$ in advance, has an overall cost that is sublinear in $q$, namely, $O^*(\sqrt q \cdot(n/\sqrt m))$, which is strictly preferable to $O^*(q\cdot (n/\sqrt m))$ cost resulting from $q$ invocations of the algorithm by Eden and Rosenbaum. Subsequent to a preliminary version of this work, T\v{e}tek and Thorup (arXiv, preprint) proved that this bound is essentially optimal.
翻译:我们提出一个子线性时间算法, 允许一个人从分布中抽取 $\ epsilon$- 接近统一分布, 以 emph{ amortized- effective} 的方式。 我们考虑对匹配列表查询模式, 通过度和邻接查询访问图形$G$。 这个模型中取样单一边缘的问题由 Eden 和 Rosenbaum (SOSA 18) 提出。 请用 $ 和 $( $) 表示一个垂直和边缘的数量( 美元) 。 Tden 和 Rosenbaum 提供了 $\ Theta\\ ( n/\ sqrt) 的上下边框 。 用于在一般图形中取样单一边缘( $@\\ (\\\\\\\\\\ ecd) 美元) 访问。 如果需要多个样本, 美元, 我们能否在目前 Aqrqrl 中以美元 美元 的平价 。