We study the long-term behavior of the two-thinning variant of the classical balls-and-bins model. In this model, an overseer is provided with uniform random allocation of $m$ balls into $n$ bins in an on-line fashion. For each ball, the overseer could reject its allocation and place the ball into a new bin drawn independently at random. The purpose of the overseer is to reduce the maximum load of the bins, which is defined as the difference between the maximum number of balls in a single bin and $m/n$, i.e., the average number of balls among all bins. We provide tight estimates for three quantities: the lowest maximum load that could be achieved at time $m$, the lowest maximum load that could be achieved uniformly over the entire time interval $[m]:=\{1, 2, \cdots, m\}$, and the lowest \emph{typical} maximum load that could be achieved over the interval $[m]$, where the typicality means that the maximum load holds for $1-o(1)$ portion of the times in $[m]$. We show that when $m$ and $n$ are sufficiently large, a typical maximum load of $(\log n)^{1/2+o(1)}$ can be achieved with high probability, asymptotically the same as the optimal maximum load that could be achieved at time $m$. However, for any strategy, the maximal load among all times in the interval $[m]$ is $\Omega\big(\frac{\log n}{\log\log n}\big)$ with high probability. A strategy achieving this bound is provided. An explanation for this gap is provided by our optimal strategies as follows. To control the typical load, we restrain the maximum load for some time, during which we accumulate more and more bins with relatively high load. After a while, we have to employ for a short time a different strategy to reduce the number of relatively heavily loaded bins, at the expanse of temporarily inducing high load in a few bins.
翻译:我们研究古典球和宾模式的双倍变体的长期行为。 在这个模型中, 向监督员提供统一随机分配的美元球, 以在线方式将美元球划入$n 的垃圾桶。 对于每个球, 监督员可以拒绝分配, 将球放入一个任意抽取的新的垃圾桶。 监督员的目的是降低垃圾箱的最大负载, 定义为单个垃圾桶的最大球数与 美元/ 美元( 美元/ 美元/ 美元) 之间的差额。 也就是说, 所有垃圾桶的平均球数。 我们为三个数量提供了严格的估计: 在时间( 美元) 上可以达到的最低最大负载, 最低最大负载( 美元/ 2, 美元/ 美元/ 美元), 最低负载的最大负载, 在时间( $/ 美元) 的间隔内, 通常表示我们的最大负载量为1美元, 最大负载的负载是美元, 最高负数( 美元) 最高负数( 美元) 。