The random order streaming model has been very fruitful for graph streams, allowing for polylogarithmic or even constant space estimators for fundamental graph problems such as matching size estimation, counting the number of connected components and more. However, the assumption that there are no correlations between the order of arrival of edges in the stream is quite strong. In this paper we introduce (hidden) batch random order streams, where edges are grouped in "batches" (which are unknown to the algorithm) that arrive in a random order, as a natural framework for modelling hidden correlations in the arrival order of edges, and present algorithms and lower bounds for this model. On the algorithmic side, we show how known techniques for connected component counting in constant space due to Peng and Sohler [SODA `18] easily translate from random order streams to our model with only a small loss in parameters. Our algorithm obtains an additive $\varepsilon n$ approximation to the number of connected components in the input graph using space $(1/\varepsilon)^{O(1/\varepsilon)}$ by building a representative sample of vertices in the graph that belong to $O(1/\varepsilon)$-size components to estimate the count. On the lower bound side, we show that $(1/\varepsilon)^{\Omega(1/\varepsilon)}$ space is necessary for finding a connected component of size $O(1/\varepsilon)$ even in graphs where most vertices reside in such components -- this makes progress towards an open problem of Peng and Sohler [SODA `18] and constitutes our main technical contribution. The lower bound uses Fourier analytic techniques inspired by the Boolean Hidden Matching problem. Our main innovation here is the first framework for applying such a Fourier analytic approach to a communication game with a polynomial number of players.
翻译:随机序列流模式对于图形流来说非常富有成果, 允许对基本图形问题, 如匹配大小估计, 计数连接组件的数量等等, 随机序列流模型对于图形流非常有成果, 允许对基本图形问题, 比如匹配大小估计, 计数连接组件的数量。 但是, 假设流中边缘到达的顺序之间没有关联性相当强 。 在本文中, 我们引入( 隐藏) 批次随机序列流, 边际分组为随机顺序( 算法所未知的), 以随机顺序的形式到达, 以自然框架的形式建制图中以空间 $( 1/ vareplon) 的隐藏关联关系, 以及当前算法和下界限。 在运算法方面, 以 美元/\\ valeval% 来构建一个具有代表性的游戏流流流流流 。 我们的电流流流流到模型中, 以 $loral==lus mainal 这样的输入组件, 将一个具有一定的 Oreval mo modeal deval 。