Modeling distributions of covariates, or density estimation, is a core challenge in unsupervised learning. However, the majority of work only considers the joint distribution, which has limited utility in practical situations. A more general and useful problem is arbitrary conditional density estimation, which aims to model any possible conditional distribution over a set of covariates, reflecting the more realistic setting of inference based on prior knowledge. We propose a novel method, Arbitrary Conditioning with Energy (ACE), that can simultaneously estimate the distribution $p(\mathbf{x}_u \mid \mathbf{x}_o)$ for all possible subsets of unobserved features $\mathbf{x}_u$ and observed features $\mathbf{x}_o$. ACE is designed to avoid unnecessary bias and complexity -- we specify densities with a highly expressive energy function and reduce the problem to only learning one-dimensional conditionals (from which more complex distributions can be recovered during inference). This results in an approach that is both simpler and higher-performing than prior methods. We show that ACE achieves state-of-the-art for arbitrary conditional likelihood estimation and data imputation on standard benchmarks.
翻译:在未经监督的学习中,模拟共变分布或密度估计是一个核心挑战。然而,大多数工作只考虑联合分布,在实际情况下效用有限。一个更普遍和有用的问题是任意的有条件密度估计,目的是在一组共变分布的基础上模拟任何可能的有条件分布,反映基于先前知识的更现实的推论环境。我们提出了一个新颖的方法,即 " 与能源任意配置(ACE) ",该方法可以同时估计所有未观测特征组别($\mathb{x ⁇ u$和观察到的特性$\mathbf{x ⁇ u$)的合销(美元和美元)。 ACE旨在避免不必要的偏差和复杂性,我们指定了高度清晰的能源功能的密度,并将问题降低到只学习一维条件(从中可以恢复更复杂的分布 ) 。 这种方法的结果是,与以往的任意性估算方法相比,表现更简单和更高。