In this paper, we describe a method for estimating the joint probability density from data samples by assuming that the underlying distribution can be decomposed as a mixture of product densities with few mixture components. Prior works have used such a decomposition to estimate the joint density from lower-dimensional marginals, which can be estimated more reliably with the same number of samples. We combine two key ideas: dictionaries to represent 1-D densities, and random projections to estimate the joint distribution from 1-D marginals, explored separately in prior work. Our algorithm benefits from improved sample complexity over the previous dictionary-based approach by using 1-D marginals for reconstruction. We evaluate the performance of our method on estimating synthetic probability densities and compare it with the previous dictionary-based approach and Gaussian Mixture Models (GMMs). Our algorithm outperforms these other approaches in all the experimental settings.
翻译:本文提出了一种方法,通过假设潜在分布可以被分解为少量混合成分的乘积密度,从数据样本中估计联合概率密度。以往的研究已经使用这种分解方法从低维边际中估计联合概率密度,使用相同数量的样本即可更可靠地估计。我们结合了两个关键思想:使用字典来表示1-D密度和随机投影从1-D边际估计联合分布。我们的算法通过使用1-D边际进行重构,从而提高了先前基于字典的方法的样本复杂度。我们评估了我们的方法在估计合成概率密度上的性能,并将其与以前的基于字典的方法和高斯混合模型(GMMs)进行比较。在所有实验设置中,我们的算法都表现优于其他方法。