联合随机逼近自编码器及其在半监督学习中的应用 (Joint-stochastic-approximation Autoencoders with Application to Semi-supervised Learning)

Our examination of existing deep generative models (DGMs), including VAEs and GANs, reveals two problems. First, their capability in handling discrete observations and latent codes is unsatisfactory, though there are interesting efforts. Second, both VAEs and GANs optimize some criteria that are indirectly related to the data likelihood. To address these problems, we formally present Joint-stochastic-approximation (JSA) autoencoders - a new family of algorithms for building deep directed generative models, with application to semi-supervised learning. The JSA learning algorithm directly maximizes the data log-likelihood and simultaneously minimizes the inclusive KL divergence the between the posteriori and the inference model. We provide theoretical results and conduct a series of experiments to show its superiority such as being robust to structure mismatch between encoder and decoder, consistent handling of both discrete and continuous variables. Particularly we empirically show that JSA autoencoders with discrete latent space achieve comparable performance to other state-of-the-art DGMs with continuous latent space in semi-supervised tasks over the widely adopted datasets - MNIST and SVHN. To the best of our knowledge, this is the first demonstration that discrete latent variable models are successfully applied in the challenging semi-supervised tasks.

翻译：我们对现有深度生成模型（包括VAE和GAN）的研究揭示了两个问题。首先，尽管已有一些有趣尝试，但其处理离散观测与隐变量的能力仍不尽如人意。其次，VAE和GAN所优化的目标函数均与数据似然仅有间接关联。为解决这些问题，本文正式提出联合随机逼近自编码器——一种用于构建深度有向生成模型的全新算法体系，并将其应用于半监督学习任务。JSA学习算法直接最大化数据对数似然，同时最小化后验分布与推断模型之间的包容KL散度。我们提供了理论证明，并通过系列实验验证了其优越性：例如对编码器-解码器结构失配具有鲁棒性，能一致处理离散与连续变量。特别地，实验表明在MNIST和SVHN等广泛采用的数据集上，具有离散隐空间的JSA自编码器在半监督任务中达到了与连续隐空间最先进深度生成模型相当的性能。据我们所知，这是首次证明离散隐变量模型能成功应用于具有挑战性的半监督学习任务。