We develop a new framework for embedding joint probability distributions in tensor product reproducing kernel Hilbert spaces (RKHS). Our framework accommodates a low-dimensional, normalized and positive model of a Radon-Nikodym derivative, which we estimate from sample sizes of up to several million data points, alleviating the inherent limitations of RKHS modeling. Well-defined normalized and positive conditional distributions are natural by-products to our approach. The embedding is fast to compute and accommodates learning problems ranging from prediction to classification. Our theoretical findings are supplemented by favorable numerical results.
翻译:我们开发了一个新的框架,将联合概率分布嵌入到张量积重现核希尔伯特空间(RKHS)中。我们的框架包含了一个低维,归一化和正的Radon-Nikodym导数模型,我们可以从多达数百万个数据点的样本量中估计出来,从而缓解了RKHS建模的固有限制。良好定义的归一化和正条件分布是我们方法的自然副产品。嵌入式计算速度快,适用于从预测到分类的学习问题。我们的理论结果得到了有利的数字结果的支持。