The variational lower bound (a.k.a. ELBO or free energy) is the central objective for many learning algorithms including algorithms for deep unsupervised learning. Learning algorithms change model parameters such that the variational lower bound increases, and until the parameters are close to a stationary point of the learning dynamics. In this purely theoretical contribution, we show that (for a very large class of generative models) the variational lower bound is at all stationary points of learning equal to a sum of entropies. For models with one set of latents and one set observed variables, the sum consists of three entropies: (A) the (average) entropy of the variational distributions, (B) the negative entropy of the model's prior distribution, and (C) the (expected) negative entropy of the observable distributions. The obtained result applies under realistic conditions including: finite numbers of data points, at any stationary points (including saddle points) and for any family of (well behaved) variational distributions. The class of generative models for which we show the equality to entropy sums contains many (and presumably most) standard generative models (including deep models). As concrete examples we discuss probabilistic PCA and Sigmoid Belief Networks. The prerequisites we use to show equality to entropy sums are relatively mild. Concretely, the distributions of a given generative model have to be of the exponential family (with constant base measure), and a model has to satisfy a parameterization criterion (which is usually fulfilled). Proving the equality of the ELBO to entropy sums at stationary points (under the stated conditions) is the main contribution of this work.
翻译:变式较低约束( a.k.a. a. a. ELBO 或免费能源) 是许多学习算法的核心目标, 包括深层不受监督学习的算法。 学习算法会改变模型参数, 使变式较低约束增加, 直至参数接近学习动态的固定点。 在这种纯粹的理论贡献中, 我们显示( 对于一大类基因变异模型来说) 变式较低约束在所有固定学习点的学习点, 等于一个温度值之和。 对于具有一组潜值和一组观察变量的模型, 其总和由三种寄生体组成:( A) 变式分布的( 平均) 模型变式参数, (B) 变式下下调增加, 等值增加, 等值分配的负值, (C) 观察分布结果在现实条件下适用, 包括: 数据点的定数, 任何固定点( 包括垫子点) 和任何( 良好表现的) 变式分布。 变式模型的变式模型的类别, 通常为变式的变式模型, 显示( 我们显示) 变式主要的变式模型为正的基数。