Directed acyclic graph (DAG) models have become widely studied and applied in statistics and machine learning -- indeed, their simplicity facilitates efficient procedures for learning and inference. Unfortunately, these models are not closed under marginalization, making them poorly equipped to handle systems with latent confounding. Acyclic directed mixed graph (ADMG) models characterize margins of DAG models, making them far better suited to handle such systems. However, ADMG models have not seen wide-spread use due to their complexity and a shortage of statistical tools for their analysis. In this paper, we introduce the m-connecting imset which provides an alternative representation for the independence models induced by ADMGs. Furthermore, we define the m-connecting factorization criterion for ADMG models, characterized by a single equation, and prove its equivalence to the global Markov property. The m-connecting imset and factorization criterion provide two new statistical tools for learning and inference with ADMG models. We demonstrate the usefulness of these tools by formulating and evaluating a consistent scoring criterion with a closed form solution.
翻译:在统计和机器学习中广泛研究和应用了定向环形图(DAG)模型 -- -- 事实上,这些模型的简单性为学习和推断的有效程序提供了便利。不幸的是,这些模型在边缘化情况下没有被关闭,使它们无法很好地处理潜伏混乱的系统。环形定向混合图(ADMG)模型是DAG模型的边缘特征,使它们更适合处理这些系统。然而,ADMG模型由于复杂性和缺乏用于分析的统计工具而没有被广泛使用。在本文中,我们引入了M-连锁模型,为由ADMGs生成的独立模型提供了替代的代号。此外,我们界定了以单一方程式为特征的ADMG模型的M-因子化标准,并证明它与全球Markov特性的等值。M-直接图和因子化标准为与ADMG模型的学习和推断提供了两种新的统计工具。我们通过制定和评估一种封闭式解决方案的一致的评分标准来证明这些工具的效用。