We consider extensions of the Shannon relative entropy, referred to as f-divergences. Three classical related computational problems are typically associated with these divergences: (a) estimation from moments, (b) computing normalizing integrals, and (c) variational inference in probabilistic models. These problems are related to one another through convex duality, and for all them, there are many applications throughout data science, and we aim for computationally tractable approximation algorithms that preserve properties of the original problem such as potential convexity or monotonicity. In order to achieve this, we derive a sequence of convex relaxations for computing these divergences from non-centered covariance matrices associated with a given feature vector: starting from the typically non-tractable optimal lower-bound, we consider an additional relaxation based on "sums-of-squares", which is is now computable in polynomial time as a semidefinite program, as well as further computationally more efficient relaxations based on spectral information divergences from quantum information theory. For all of the tasks above, beyond proposing new relaxations, we derive tractable algorithms based on augmented Lagrangians and first-order methods, and we present illustrations on multivariate trigonometric polynomials and functions on the Boolean hypercube.
翻译:我们考虑香农相对酶的延伸,称为 f-diverences 。 三种典型相关的计算问题通常与这些差异相关:(a) 从瞬间估算, (b) 计算成正常整体体, (c) 概率模型中的变异推论。 这些问题通过共性双重性彼此相关, 对它们来说, 在整个数据科学中有许多应用, 我们的目标是可计算可移动的近似算法, 以保存原始问题( 如潜在共性或单调性)的特性。 为了实现这一点, 我们为计算与特定特性矢量相关的非中心共变异矩阵的差异, 得出一系列共解调调调顺序:(a) 从通常不可抽取的最佳较低约束性模型开始, 我们考虑根据“ 共和量” 来进一步放松, 现在在多货币时可以比较为半确定性程序, 以及基于光谱信息与量信息差异和量子信息理论的进一步计算更高效的放松调。 对于所有基于当前顶级的超常量性矩阵和多货币计算方法的任务, 我们提出新的拉尼加性矩阵, 和基于以上三阶的增制。