We formally map the problem of sampling from an unknown distribution with a density in $\mathbb{R}^d$ to the problem of learning and sampling a smoother density in $\mathbb{R}^{Md}$ obtained by convolution with a fixed factorial kernel: the new density is referred to as M-density and the kernel as multimeasurement noise model (MNM). The M-density in $\mathbb{R}^{Md}$ is smoother than the original density in $\mathbb{R}^d$, easier to learn and sample from, yet for large $M$ the two problems are mathematically equivalent since clean data can be estimated exactly given a multimeasurement noisy observation using the Bayes estimator. To formulate the problem, we derive the Bayes estimator for Poisson and Gaussian MNMs in closed form in terms of the unnormalized M-density. This leads to a simple least-squares objective for learning parametric energy and score functions. We present various parametrization schemes of interest including one in which studying Gaussian M-densities directly leads to multidenoising autoencoders--this is the first theoretical connection made between denoising autoencoders and empirical Bayes in the literature. Samples in $\mathbb{R}^d$ are obtained by walk-jump sampling (Saremi & Hyvarinen, 2019) via underdamped Langevin MCMC (walk) to sample from M-density and the multimeasurement Bayes estimation (jump). We study permutation invariant Gaussian M-densities on MNIST, CIFAR-10, and FFHQ-256 datasets, and demonstrate the effectiveness of this framework for realizing fast-mixing stable Markov chains in high dimensions.
翻译:我们正式地将取样问题从一个密度为$mathb{R ⁇ d$的未知分布问题,从一个密度为$mathb{R ⁇ d$,到学习和采样以美元为单位的平滑密度为$mathb{R ⁇ d$,到学习和取样以美元为单位的平滑密度为单位的问题。新密度被称为M-密度和内核为多度测量噪音模型(MNM);M-密度为$mathb{R ⁇ d$,比美元为单位的原始密度为单位的 mathlutb{R ⁇ d$,但对于大M$而言,这两个问题在数学上是相等的,因为清洁数据可以精确估计,因为使用Bayes估测器进行多度测量性观测。为了形成问题,我们用不正规的Moisson和Gaussian MNDMNMs 模式来以封闭形式计算Bayes的测算。这导致一个简单的最低比例目标,用于学习对等量的能量和分分数函数。我们展示了各种平面的平价计划,其中包括直压和直压数据连接。