Bayesian networks are a widely-used class of probabilistic graphical models capable of representing symmetric conditional independence between variables of interest using the topology of the underlying graph. For categorical variables, they can be seen as a special case of the much more general class of models called staged trees, which can represent any type of non-symmetric conditional independence. Here we formalize the relationship between these two models and introduce a minimal Bayesian network representation of the staged tree, which can be used to read conditional independences in an intutitive way. A new labeled graph termed asymmetry-labeled directed acyclic graph is defined, whose edges are labeled to denote the type of dependence existing between any two random variables. We also present a novel algorithm to learn staged trees which only enforces a specific subset of non-symmetric independences. Various datasets are used to illustrate the methodology, highlighting the need to construct models which more flexibly encode and represent non-symmetric structures.
翻译:Bayesian 网络是一个广泛使用的概率图形模型类别,能够代表使用底图的地形图显示感兴趣的变量之间的对称性有条件独立。 对于绝对变量,它们可以被视为更普通的模型类别的特殊例子,称为阶梯树,它可以代表任何非对称性有条件独立。在这里,我们正式确定这两个模型之间的关系,并采用一个最小的Bayesian 网络代表阶梯树,它可以用来以非对称方式读取有条件独立。定义了一个新的标签式图,称为不对称标签的定向环流图,其边缘被贴上标签,以表示任何两个随机变量之间的依赖性。我们还提出了一种新式算法,以学习阶梯树,只执行非对称性独立的具体组别。使用多种数据集来说明方法,强调建立模型的必要性,这些模型更灵活地显示非对称的编码和代表非对称结构。