通过自分解增强多变生成 (Enhancing variational generation through self-decomposition)

In this article we introduce the notion of Split Variational Autoencoder (SVAE), whose output $\hat{x}$ is obtained as a weighted sum $\sigma \odot \hat{x_1} + (1-\sigma) \odot \hat{x_2}$ of two generated images $\hat{x_1},\hat{x_2}$, and $\sigma$ is a learned compositional map. The network is trained as a usual Variational Autoencoder with a negative loglikelihood loss between training and reconstructed images. The decomposition is nondeterministic, but follows two main schemes, that we may roughly categorize as either "syntactic" or "semantic". In the first case, the map tends to exploit the strong correlation between adjacent pixels, splitting the image in two complementary high frequency sub-images. In the second case, the map typically focuses on the contours of objects, splitting the image in interesting variations of its content, with more marked and distinctive features. In this case, the Fr\'echet Inception Distance (FID) of $\hat{x_1}$ and $\hat{x_2}$ is usually lower (hence better) than that of $\hat{x}$, that clearly suffers from being the average of the formers. In a sense, a SVAE forces the Variational Autoencoder to {\em make choices}, in contrast with its intrinsic tendency to average between alternatives with the aim to minimize the reconstruction loss towards a specific sample. According to the FID metric, our technique, tested on typical datasets such as Mnist, Cifar10 and Celeba, allows us to outperform all previous purely variational architectures (not relying on normalization flows).

翻译：在此文章中, 我们引入了 Slip Variational Autencoder (SVAE) 的概念, 其输出 $\ hat{x} $( SVAE ) 是一个加权和数 $squm $\ sgma \ had{x_ 1} + (1-\ sgma) \ hat{x_ 2} 美元, 由两种生成的图像 $\ hat{x_ 1} 和 $\ sigma$ (SVAE) 。网络被训练成一个普通的 Variational- 自动coder, 在训练和重建图像之间有负的对位值损失。解析是非非确定性, 但是遵循两种主要方案, 我们可能大致将“ 同步” 或“ shoadbot\\\ x} $\ x 美元美元。在第一个案例中, 地图倾向于利用我们相邻的像群之间的紧密关联, 将图像分割成两个相配以高频次图像。在第二个案例中,, 地图通常以对象以对象为主控点为对象, 将对象, 将图像的对图像进行对比,, 其内, 其内, 将图像的图图图图图在更动, 其内, 其内, 其内, 其内, 其内, 其内, 其内, 其内, 其内, 其内, 其内, 其内向更变变为直为直为直为直为直为直为。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日