The study of Markov processes and broadcasting on trees has deep connections to a variety of areas including statistical physics, graphical models, phylogenetic reconstruction, Markov Chain Monte Carlo, and community detection in random graphs. Notably, the celebrated Belief Propagation (BP) algorithm achieves Bayes-optimal performance for the reconstruction problem of predicting the value of the Markov process at the root of the tree from its values at the leaves. Recently, the analysis of low-degree polynomials has emerged as a valuable tool for predicting computational-to-statistical gaps. In this work, we investigate the performance of low-degree polynomials for the reconstruction problem on trees. Perhaps surprisingly, we show that there are simple tree models with $N$ leaves and bounded arity where (1) nontrivial reconstruction of the root value is possible with a simple polynomial time algorithm and with robustness to noise, but not with any polynomial of degree $N^{c}$ for $c > 0$ a constant depending only on the arity, and (2) when the tree is unknown and given multiple samples with correlated root assignments, nontrivial reconstruction of the root value is possible with a simple Statistical Query algorithm but not with any polynomial of degree $N^c$. These results clarify some of the limitations of low-degree polynomials vs. polynomial time algorithms for Bayesian estimation problems. They also complement recent work of Moitra, Mossel, and Sandon who studied the circuit complexity of Belief Propagation. As a consequence of our main result, we show that for some $c' > 0$ depending only on the arity, $\exp(N^{c'})$ many samples are needed for RBF kernel regression to obtain nontrivial correlation with the true regression function (BP). We pose related open questions about low-degree polynomials and the Kesten-Stigum threshold.
翻译:Markov 进程和树上广播的研究与一系列领域有着深刻的联系,包括统计物理、图形模型、血压模型重建、Markov 链子蒙特卡洛以及随机图中社区检测。值得注意的是,著名的信仰促进算法(BP)在重建问题上实现了Bayes-最佳性能,从树的树根上预测Markov进程的价值,从树叶上的值到树叶上的值。最近,对低度多元分子学的分析已成为一个宝贵的工具,用来预测货币至统计的深度差距。在这项工作中,我们调查了低度多度多度多级分子对树木重建问题的表现。也许令人惊讶的是,我们发现有简单的树模型,有美元叶叶子和边际的表面值,有简单多级的根值重建结果,有简单且具有多级的多级货币分析结果,这些根值的根值有可能通过简单的多元性算法进行重建,但是对于美元- 低度的直位数的直径直径的直径直径直值分析,我们只能根据直径直的直径直径直的直径直径直的直的直估值估算。 当树木的直数的直数的直值研究结果,而亚的直系的直系的直系的直系的直系的直系的直系的直系的直系的直系的直系的直系的直系的直數数值和直系的直系的直系的直系的直系、直系的直系的直系的直系结果。