Learning the graphical structure of Bayesian networks is key to describing data-generating mechanisms in many complex applications but poses considerable computational challenges. Observational data can only identify the equivalence class of the directed acyclic graph underlying a Bayesian network model, and a variety of methods exist to tackle the problem. Under certain assumptions, the popular PC algorithm can consistently recover the correct equivalence class by reverse-engineering the conditional independence (CI) relationships holding in the variable distribution. The dual PC algorithm is a novel scheme to carry out the CI tests within the PC algorithm by leveraging the inverse relationship between covariance and precision matrices. By exploiting block matrix inversions we can simultaneously perform tests on partial correlations of complementary (or dual) conditioning sets. The multiple CI tests of the dual PC algorithm proceed by first considering marginal and full-order CI relationships and progressively moving to central-order ones. Simulation studies show that the dual PC algorithm outperforms the classic PC algorithm both in terms of run time and in recovering the underlying network structure, even in the presence of deviations from Gaussianity. Additionally, we show that the dual PC algorithm applies for Gaussian copula models, and demonstrate its performance in that setting.
翻译:学习Bayesian网络的图形结构是在许多复杂应用中描述数据生成机制的关键,但具有相当大的计算挑战。观察数据只能确定作为Bayesian网络模型基础的定向单程图的等值等级,并存在着解决这一问题的各种方法。根据某些假设,流行的PC算法可以通过逆向工程,持续恢复在变量分布中保持的有条件独立(CI)关系正确的等值等级。双重PC算法是一种新颖的计划,在PC算法中进行CI测试,利用共变和精确矩阵之间的反向关系。通过利用正反向矩阵,我们可以同时进行对互补(或双向)调节装置部分相关性的组合矩阵测试。双重PC算法的多重CI测试首先考虑边际和全序的CI关系,然后逐步转向中央命令关系。模拟研究表明,双元PC算法在运行时间和修复基础网络结构方面都超越了典型的PC算法,即使存在与高斯立的偏差。此外,我们展示了双重PC算法适用于Gaussian 的运行模式并展示其性。