以美元为密度最强的子高血压问题所种植的美元密度的统计和计算阈值 (Statistical and computational thresholds for the planted $k$-densest sub-hypergraph problem)

Recovery a planted signal perturbed by noise is a fundamental problem in machine learning. In this work, we consider the problem of recovery a planted $k$-densest sub-hypergraph on $h$-uniform hypergraphs over $n$ nodes. This fundamental problem appears in different contexts, e.g., community detection, average case complexity, and neuroscience applications. We first observe that it can be viewed as a structural variant of tensor PCA in which the hypergraph parameters $k$ and $h$ determine the structure of the signal to be recovered when the observations are contaminated by Gaussian noise. In this work, we provide tight information-theoretic upper and lower bounds for the recovery problem, as well as the first non-trivial algorithmic bounds based on approximate message passing algorithms. The problem exhibits a typical information-to-computational-gap observed in analogous settings, that widens with increasing sparsity of the problem. Interestingly, the bounds show that the structure of the signal does have an impact on the existing bounds of tensor PCA that the unstructured planted signal does not capture.

翻译：在这项工作中,我们考虑的是对美元单式高压高压高压高压高压高压高压高压电压的人工加压子高压速谱进行回收的问题。这一根本问题出现在不同的背景中,例如社区探测、平均案件复杂度和神经科学应用等。我们首先发现,它可被视为高压五氯苯甲醚的一个结构变体,其中高压参数为美元和美元,确定在观测受到高斯噪音污染时要恢复的信号结构。在这项工作中,我们为恢复问题提供了紧密的信息理论上下界,以及基于大致信息传递算法的第一个非三维算法界限。问题显示了在类似环境下观察到的典型信息到算法,随着问题日益紧张而扩大。有趣的是,这些界限表明信号的结构确实对未结构化的信号无法捕捉捉到的索诺尔常设仲裁院的现有界限产生了影响。

相关内容

PCA

关注 3

在统计中，主成分分析（PCA）是一种通过最大化每个维度的方差来将较高维度空间中的数据投影到较低维度空间中的方法。给定二维，三维或更高维空间中的点集合，可以将“最佳拟合”线定义为最小化从点到线的平均平方距离的线。可以从垂直于第一条直线的方向类似地选择下一条最佳拟合线。重复此过程会产生一个正交的基础，其中数据的不同单个维度是不相关的。这些基向量称为主成分。

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

专知会员服务

30+阅读 · 2020年11月4日

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【NYU CS-GY 9223I】算法机器学习和数据科学（Algorithmic Machine Learning and Data Science），纽约大学坦顿工程学院计算机科学与工程助理教授 |Christopher Musco

专知会员服务

20+阅读 · 2019年12月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation