Recent developments in Markov chain Monte Carlo (MCMC) algorithms now allow us to run thousands of chains in parallel almost as quickly as a single chain, using hardware accelerators such as GPUs. We explore the benefits of running many chains, with an emphasis on achieving a target precision in as short a time as possible. One expected advantage is that, while each chain still needs to forget its initial point during a warmup phase, the subsequent sampling phase can be almost arbitrarily short. To determine if the resulting short chains are reliable, we need to assess how close the Markov chains are to convergence to their stationary distribution. The $\hat R$ statistic is a general purpose and popular convergence diagnostic but unfortunately can require a long sampling phase to work well. We present a nested design to overcome this challenge and a generalization called nested $\hat R$. This new diagnostic works under conditions similar to $\hat R$ and completes the MCMC workflow for many GPU-friendly samplers. In addition, the proposed nesting provides theoretical insights into the utility of $\hat R$, in both classical and short-chains regimes.
翻译:Markov连锁公司Monte Carlo(MCMC)的最近发展现在使我们几乎能够利用GPU等硬件加速器,像一个单一链条一样快速平行运行数千个链条。我们探索了运行许多链条的好处,重点是在尽可能短的时间内达到目标精确度。一个预期的优势是,尽管每个链条仍然需要在暖化阶段忘记其初始点,但随后的取样阶段可能几乎是任意的短路。为了确定由此产生的短链条是否可靠,我们需要评估Markov链条与固定分布之间的距离。$R$统计是一个一般目的和大众趋同诊断,但不幸的是,可能需要一个长的取样阶段才能顺利运作。我们提出了一个固定的设计,以克服这一挑战,并有一个叫作“$hat R$”的一般化。这个新的诊断工作在类似$R$的条件下进行,并完成许多对GPU-友好的采样器的MC工作流程。此外,拟议的嵌套提供了对$\hat R$的理论洞察。