This paper introduces a new interpretation of the Variational Autoencoder framework by taking a fully geometric point of view. We argue that vanilla VAE models unveil naturally a Riemannian structure in their latent space and that taking into consideration those geometrical aspects can lead to better interpolations and an improved generation procedure. This new proposed sampling method consists in sampling from the uniform distribution deriving intrinsically from the learned Riemannian latent space and we show that using this scheme can make a vanilla VAE competitive and even better than more advanced versions on several benchmark datasets. Since generative models are known to be sensitive to the number of training samples we also stress the method's robustness in the low data regime.
翻译:本文从完全几何的角度介绍了对变异自动编码框架的新解释。我们认为,香草VAE模型自然暴露出其潜在空间的里曼结构,考虑到这些几何方面因素,可以导致更好的内插和改进生成程序。这一新的拟议抽样方法包括从从学习的里曼潜伏空间内在产生的统一分布中取样,我们表明,使用这一方法可以使香草VAE具有竞争力,甚至比几个基准数据集的先进版本更好。由于已知基因模型对培训样本的数量敏感,我们也强调该方法在低数据系统中的稳健性。