Recent works leveraging learning to enhance sampling have shown promising results, in particular by designing effective non-local moves and global proposals. However, learning accuracy is inevitably limited in regions where little data is available such as in the tails of distributions as well as in high-dimensional problems. In the present paper we study an Explore-Exploit Markov chain Monte Carlo strategy ($Ex^2MCMC$) that combines local and global samplers showing that it enjoys the advantages of both approaches. We prove $V$-uniform geometric ergodicity of $Ex^2MCMC$ without requiring a uniform adaptation of the global sampler to the target distribution. We also compute explicit bounds on the mixing rate of the Explore-Exploit strategy under realistic conditions. Moreover, we also analyze an adaptive version of the strategy ($FlEx^2MCMC$) where a normalizing flow is trained while sampling to serve as a proposal for global moves. We illustrate the efficiency of $Ex^2MCMC$ and its adaptive version on classical sampling benchmarks as well as in sampling high-dimensional distributions defined by Generative Adversarial Networks seen as Energy Based Models. We provide the code to reproduce the experiments at the link: https://github.com/svsamsonov/ex2mcmc_new.
翻译:最近的利用学习知识加强取样工作已显示出令人乐观的结果,特别是通过设计有效的非本地移动和全球建议,但学习的准确性不可避免地在缺乏数据的区域受到限制,如分布的尾部和高层面问题,我们在本文件中研究探索-开发Markov连锁的蒙特卡洛战略(Ex=2MCMC$),该战略将当地和全球采样者结合起来,表明它享有这两种方法的优势。我们证明,在不要求全球采样员统一适应目标分布的情况下,美元为Ex%2MC$的统一几何性。我们还在现实条件下对探索-开发战略的混合率作出明确限制。此外,我们还分析了战略的适应性版本(FlEx=2MC$),在进行正常流动的同时,进行取样以作为全球移动的建议。我们举例说明了Ex=2MC$的效率及其在典型采样基准方面的适应性版本,以及在对Geneari Aversarith网络界定的高维度分布进行取样。我们还提供了Genementary Adversaribeth Networks 的链接:Megrodals am/Basedalbus。