Auto encoding models have been extensively studied in recent years. They provide an efficient framework for sample generation, as well as for analysing feature learning. Furthermore, they are efficient in performing interpolations between data-points in semantically meaningful ways. In this paper, we build further on a previously introduced method for generating canonical, dimension independent, stochastic interpolations. Here, the distribution of interpolation paths is represented as the distribution of a bridge process constructed from an artificial random data generating process in the latent space, having the prior distribution as its invariant distribution. As a result the stochastic interpolation paths tend to reside in regions of the latent space where the prior has high mass. This is a desirable feature since, generally, such areas produce semantically meaningful samples. In this paper, we extend the bridge process method by introducing a discriminator network that accurately identifies areas of high latent representation density. The discriminator network is incorporated as a change of measure of the underlying bridge process and sampling of interpolation paths is implemented using sequential Monte Carlo. The resulting sampling procedure allows for greater variability in interpolation paths and stronger drift towards areas of high data density.
翻译:近年来,对自动编码模型进行了广泛研究,为样本生成和分析特征学习提供了一个有效的框架;此外,这些模型有效地在数据点之间以语义上有意义的方式进行干涉;在本文中,我们进一步以以前采用的方法为基础,生成直截了当的、维度独立的、随机的内插图。这里,内插路径的分布代表着从暗中空间人为随机生成数据的过程中构建的桥梁过程的分布,其先前的分布以其变化性分布为先导分布。因此,迭接的内插图往往位于前高质量的暗层空间区域。这是一个可取的特征,因为一般来说,这类区域产生具有语义意义的样本。在本文中,我们通过引入一个精确识别高潜在代表密度地区的区分器网络来扩展连接过程。歧视网络是作为测量深层桥进程的一种变化而纳入的,而内插图的取样则使用连续的蒙特卡洛进行。由此产生的取样程序使得内插图路径的变异性更大,并更有力地流向高密度区域。