We consider the problem of learning the structure underlying a Gaussian graphical model when the variables (or subsets thereof) are corrupted by independent noise. A recent line of work establishes that even for tree-structured graphical models, only partial structure recovery is possible and goes on to devise algorithms to identify the structure up to an (unavoidable) equivalence class of trees. We extend these results beyond trees and consider the model selection problem under noise for non tree-structured graphs, as tree graphs cannot model several real-world scenarios. Although unidentifiable, we show that, like the tree-structured graphs, the ambiguity is limited to an equivalence class. This limited ambiguity can help provide meaningful clustering information (even with noise), which is helpful in computer and social networks, protein-protein interaction networks, and power networks. Furthermore, we devise an algorithm based on a novel ancestral testing method for recovering the equivalence class. We complement these results with finite sample guarantees for the algorithm in the high-dimensional regime.
翻译:当变量(或其子集)被独立噪音破坏时,我们考虑学习高斯图形模型背后的结构的问题。最近的工作轨迹确定,即使树结构图形模型,也只能回收部分结构,并且继续设计算法,确定结构直至(不可避免的)树等效类。我们将这些结果扩大到树以外,并在非树结构图的噪音下考虑模型选择问题,因为树图无法模拟几种真实世界情景。虽然无法识别,但我们显示,像树结构图一样,模糊性限于等同类。这种有限的模糊性有助于提供有意义的组合信息(即使有噪音),这些信息对计算机和社会网络、蛋白质-蛋白互动网络和电力网络都有帮助。此外,我们根据一种新颖的祖传测试方法设计一种算法,以恢复等同类。我们用高维系统算法的有限样本保证来补充这些结果。