Density estimation appears as a subroutine in many learning procedures, so it is of interest to have efficient methods for it to perform in practical situations. Multidimensional density estimation suffers from the curse of dimensionality. A solution to this problem is to add a structural hypothesis through an undirected graphical model on the underlying distribution. We propose ISDE (Independence Structure Density Estimation), an algorithm designed to estimate a density and an undirected graphical model from a particular family of graphs corresponding to Independence Structure (IS), a situation where we can separate features into independent groups. ISDE works for moderately high-dimensional data (up to a few dozen features), and it is useable in parametric and nonparametric situations. Existing methods on nonparametric graphical model estimation focus on multidimensional dependencies only through pairwise ones: ISDE does not suffer from this restriction and can address structures not yet covered by available algorithms. In this paper, we present the existing theory about IS, explain the construction of our algorithm and prove its effectiveness. This is done on synthetic data both quantitatively, through measures of density estimation performance under Kullback-Leibler loss, and qualitatively, in terms of capability to recover IS. By applying ISDE on mass cytometry datasets, we also show how it performs both quantitatively and qualitatively on real-world datasets. Then we provide information about running time.
翻译:在很多学习程序中,密度估计似乎是一种次常规,因此,有必要采用有效方法,使其在实际情况下发挥作用。多元密度估计受到维度的诅咒。这个问题的一个解决办法是,通过一个非定向的分布分布图形模型,增加一个结构假设。我们建议ISDE(独立结构密度估计),一种旨在估计密度的算法,以及一个与独立结构(IS)相对应的特定图表系列的非定向图形模型,一种我们可以将其特性分离成独立群体的情况。ISDE的工作是中度高度数据(最多有几十个特征),在参数和非参数情况下可以使用。关于非对称图形模型估计的现有方法,只通过对基本分布进行非定向的图形模型估计。我们建议ISDE不受到这种限制,可以处理现有算法尚未覆盖的结构。在本文中,我们介绍了关于ISA的现有理论,解释了我们算法的构建,并证明了其有效性。这是在定量数据上,通过在KUULE-DE-Develil数据库下测量密度业绩的尺度,我们用SIS-Qal-deal-dealimal-deal-dealmarial-deal-deal-demodeal dasation dasmation das