Many methods that build powerful variational distributions based on unadjusted Langevin transitions exist. Most of these were developed using a wide range of different approaches and techniques. Unfortunately, the lack of a unified analysis and derivation makes developing new methods and reasoning about existing ones a challenging task. We address this giving a single analysis that unifies and generalizes these existing techniques. The main idea is to augment the target and variational by numerically simulating the underdamped Langevin diffusion process and its time reversal. The benefits of this approach are twofold: it provides a unified formulation for many existing methods, and it simplifies the development of new ones. In fact, using our formulation we propose a new method that combines the strengths of previously existing algorithms; it uses underdamped Langevin transitions and powerful augmentations parameterized by a score network. Our empirical evaluation shows that our proposed method consistently outperforms relevant baselines in a wide range of tasks.
翻译:许多建立在未调整的Langevin转换基础上的强大变分分布的方法存在。其中大多数是使用不同的方法和技术开发的。不幸的是,缺乏统一的分析和推导使得开发新方法和推理现有方法变得困难。我们通过在目标和变分中增加数值模拟欠阻尼Langevin扩散过程及其时间反演来解决这个问题。这种方法的好处有两个:它为许多现有的方法提供了统一的公式,使得开发新方法变得简单。事实上,我们使用我们的公式提出了一种新方法,它结合了以前存在的算法的优点;它使用欠阻尼Langevin转换和由评分网络参数化的强大增强。我们的实证评估表明,我们提出的方法在广泛的任务中始终优于相关基线。