与 VAE 模型的变差得分匹配失败 (On the failure of variational score matching for VAE models)

Score matching (SM) is a convenient method for training flexible probabilistic models, which is often preferred over the traditional maximum-likelihood (ML) approach. However, these models are less interpretable than normalized models; as such, training robustness is in general difficult to assess. We present a critical study of existing variational SM objectives, showing catastrophic failure on a wide range of datasets and network architectures. Our theoretical insights on the objectives emerge directly from their equivalent autoencoding losses when optimizing variational autoencoder (VAE) models. First, we show that in the Fisher autoencoder, SM produces far worse models than maximum-likelihood, and approximate inference by Fisher divergence can lead to low-density local optima. However, with important modifications, this objective reduces to a regularized autoencoding loss that resembles the evidence lower bound (ELBO). This analysis predicts that the modified SM algorithm should behave very similarly to ELBO on Gaussian VAEs. We then review two other FD-based objectives from the literature and show that they reduce to uninterpretable autoencoding losses, likely leading to poor performance. The experiments verify our theoretical predictions and suggest that only ELBO and the baseline objective robustly produce expected results, while previously proposed SM methods do not.

翻译：计分匹配(SM)是培训弹性概率模型的方便方法,通常比传统的最大相似度(ML)方法更可取。然而,这些模型比常规模型更难解释;然而,这些模型比常规模型更难解释;因此,培训的稳健性一般难以评估。我们对现有差异性SM目标进行严格研究,显示大量数据集和网络结构存在灾难性故障。我们在对目标的理论见解中发现,在优化变异自动编码模型时,它们的目标与等值自动编码损失直接相关。首先,我们显示,在Fisher 自动编码器(VAE)中,SM生成的模型比最大相似性模型要差得多,而渔业差异的大致推论则可能导致低密度地方选择。然而,如果进行重大修改,这个目标将降低为与证据较低约束范围(ELBOBO)相似的正常自动编码损失。这项分析预测预测,在最佳变异性自动编码(VAusian VAEE)模型中,修改的SM算法应该非常相似。我们随后从文献中审查另外两个基于FD的目标目标的目标,并表明它们可能会导致无法进行精确的预期的实验结果。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日