Log-linear models are a family of probability distributions which capture a variety of relationships between variables, including context-specific independencies. There are a number of approaches for automatic learning of their independence structures from data, although to date, no efficient method exists for evaluating these approaches directly in terms of the structures of the models. The only known methods evaluate these approaches indirectly through the complete model produced, that includes not only the structure but also the model parameters, introducing potential distortions in the comparison. This work presents such a method, that is, a measure for the direct comparison of the independence structures of log-linear models, inspired by the Hamming distance comparison method used in undirected graphical models. The measure presented can be efficiently computed in terms of the number of variables of the domain, and is proven to be a distance metric.
翻译:逻辑线性模型是概率分布的组合,它反映了各种变量之间的各种关系,包括因地制宜的相互依存性; 有一些方法可以自动从数据中学习其独立结构,尽管到目前为止,还没有从模型的结构中直接评价这些方法的有效方法; 唯一已知的方法通过所制作的完整模型间接评价这些方法,其中不仅包括结构,也包括模型参数,在比较中引入潜在的扭曲; 这项工作提出了这样一种方法,即根据非定向图形模型使用的Hamming距离比较方法,对日志线性模型的独立结构进行直接比较的一种衡量标准; 所提出的衡量标准可以有效地计算出域变量的数量,并被证明是一种远距离衡量标准。