Hamiltonian Monte Carlo (HMC) is a popular Markov Chain Monte Carlo (MCMC) algorithm to sample from an unnormalized probability distribution. A leapfrog integrator is commonly used to implement HMC in practice, but its performance can be sensitive to the choice of mass matrix used therein. We develop a gradient-based algorithm that allows for the adaptation of the mass matrix by encouraging the leapfrog integrator to have high acceptance rates while also exploring all dimensions jointly. In contrast to previous work that adapt the hyperparameters of HMC using some form of expected squared jumping distance, the adaptation strategy suggested here aims to increase sampling efficiency by maximizing an approximation of the proposal entropy. We illustrate that using multiple gradients in the HMC proposal can be beneficial compared to a single gradient-step in Metropolis-adjusted Langevin proposals. Empirical evidence suggests that the adaptation method can outperform different versions of HMC schemes by adjusting the mass matrix to the geometry of the target distribution and by providing some control on the integration time.
翻译:汉密尔顿·蒙特卡洛(HMC)是一种流行的马克夫链子蒙特卡洛(MMC)算法,从未正常的概率分布中取样。在实际中,通常使用跳蛙集成器来实施HMC,但其性能对使用的质量矩阵的选择十分敏感。我们开发了一种梯度算法,通过鼓励跳蛙集成器具有较高的接受率,同时共同探索所有层面,允许对质量矩阵进行调整。与以前使用某种预期的平方跳跃距离来调整HMC超参数的工作不同,此处提出的适应战略的目的是通过尽量近似该提议,提高取样效率。我们指出,与Metropolis经调整的Langevin建议中单一的梯度梯度相比,在HMC建议中使用多个梯度是有好处的。从精神学证据表明,适应方法可以通过调整质量矩阵以适应目标分布的几何测量和对整合时间提供某种控制,从而优于不同版本的HMC计划。