Helmholtz Machines (HMs) are a class of generative models composed of two Sigmoid Belief Networks (SBNs), acting as an encoder and a decoder. These models are commonly trained using a two-step optimization algorithm called Wake-Sleep (WS) and more recently by improved versions, such as the Reweighted Wake-Sleep (RWS) and Bidirectional Helmholtz Machines (BiHM). The locality of the connections in a SBN induces sparsity in the Fisher information matrix associated to the model, in the form of a finely-grained block-diagonal structure. In this paper we exploit this property to efficiently train SBNs and HMs using the natural gradient. We present a novel algorithm called Natural Reweighted Wake-Sleep (NRWS), which corresponds to a geometric adaptation of the Reweighted Wake-Sleep, where, differently from most of the previous work, the natural gradient is computed without the need of introducing any approximation of the structure of the Fisher Information Matrix. The experiments performed on standard datasets from the literature show a consistent improvement of NRWS not only with respect to its non-geometric baseline but also with respect to state-of-the-art training algorithms for HMs. The improvement is quantified both in terms of speed of convergence as well as value of the log-likelihood reached after training.
翻译:Helmholtz Machines(HMS)是一组基因模型,由两个Sigmos Listial Livision 网络(SBNs)组成,作为编码器和解码器。这些模型通常使用称为Wake-Sleep(WS)的两步优化算法来训练,而最近则采用改进版,如Rew-Sleep(RWS)和Bidirectivealal Helmholtz Machine(BiHM)等。SBN连接的位置在与该模型相关的渔业信息矩阵中引起松散,其形式是精细的区际对角结构结构结构结构。在本文中,我们利用这一属性,利用自然梯度来有效地培训SBNBRS和HMs(HMs) 。我们介绍称为自然再加权休醒(RWS) (RIS) (RIS) (RIS) 和 Birective-S-Sleep) 机器(BAR) 的地理调整。与大多数以前的工作不同,在计算自然梯度时,在计算时无需对渔业信息信息矩阵结构进行任何近近似近似的架构结构结构的精确结构。在标准改进过程中进行的实验,但仅在标准上也以其基准的逻辑上进行不比级改进。