Markov networks are frequently used in sciences to represent conditional independence relationships underlying observed variables arising from a complex system. It is often of interest to understand how an underlying network differs between two conditions. In this paper, we develop methods for comparing a pair of high-dimensional Markov networks where we allow the number of observed variables to increase with the sample sizes. By taking the density ratio approach, we are able to learn the network difference directly and avoid estimating the individual graphs. Our methods are thus applicable even when the individual networks are dense as long as their difference is sparse. We prove finite-sample Gaussian approximation error bounds for the estimator we construct under significantly weaker assumptions than are typically required for model selection consistency. Furthermore, we propose bootstrap procedures for estimating quantiles of a max-type statistics based on our estimator, and show how they can be used to test the equality of two Markov networks or construct simultaneous confidence intervals. The performance of our methods is demonstrated through extensive simulations. The scientific usefulness is illustrated with an analysis of a new fMRI dataset.
翻译:Markov 网络经常用于科学, 以代表一个复杂的系统所观察到的变量背后的有条件的独立关系。 了解一个基础网络在两种条件下的差异往往令人感兴趣。 在本文中, 我们开发了比较一对高维的Markov 网络的方法, 我们允许观测到的变量数量随样本大小的增加而增加。 通过密度比例法, 我们能够直接了解网络差异, 避免估算单个图表。 因此, 我们的方法可以适用, 即使单个网络只要其差异少, 也非常密集。 我们证明, 我们建造的测算器的定点误差是有限的, 远比模型选择一致性通常所需的假设要弱得多。 此外, 我们提出基于我们测算器估算最大类型统计数据的夸度的靴式程序, 并展示如何使用它们测试两个Markov 网络的等同性或构建同步信任间隔。 我们方法的性能通过广泛的模拟得到证明。 我们的方法的性能通过分析新的FMRI数据集来说明科学效用。