Bayesian网络结构学习中知识和数据之间的信息融合 (Information fusion between knowledge and data in Bayesian network structure learning)

Bayesian Networks (BNs) have become a powerful technology for reasoning under uncertainty, particularly in areas that require causal assumptions that enable us to simulate the effect of intervention. The graphical structure of these models can be determined by causal knowledge, learnt from data, or a combination of both. While it seems plausible that the best approach in constructing a causal graph involves combining knowledge with machine learning, this approach remains underused in practice. We implement and evaluate 10 knowledge approaches with application to different case studies and BN structure learning algorithms available in the open-source Bayesys structure learning system. The approaches enable us to specify pre-existing knowledge that can be obtained from heterogeneous sources, to constrain or guide structure learning. Each approach is assessed in terms of structure learning effectiveness and efficiency, including graphical accuracy, model fitting, complexity, and runtime; making this the first paper that provides a comparative evaluation of a wide range of knowledge approaches for BN structure learning. Because the value of knowledge depends on what data are available, we illustrate the results both with limited and big data. While the overall results show that knowledge becomes less important with big data due to higher learning accuracy rendering knowledge less important, some of the knowledge approaches are actually found to be more important with big data. Amongst the main conclusions is the observation that reduced search space obtained from knowledge does not always imply reduced computational complexity, perhaps because the relationships implied by the data and knowledge are in tension.

翻译：Bayesian Network (BNs) 已成为在不确定性下进行推理的强大技术,特别是在需要因果假设,使我们能够模拟干预效果的领域。这些模型的图形结构可以由因果知识、从数据中学习,或两者的结合来决定。虽然建立因果图的最佳方法似乎似乎包括将知识与机器学习相结合,但这一方法在实践中仍然没有得到充分利用。我们实施和评价了10种知识方法,应用了开放源代码 Bayesys 结构学习系统中的不同案例研究和BN 结构学习算法。这些方法使我们能够具体说明从不同来源获得的已有知识,以限制或指导结构学习。每一种方法都能够通过结构学习效果和效率,包括图形准确性、模型适当性、复杂性和运行时间来加以评估。让第一份文件能够对BN 结构学习的广泛知识方法进行比较评价。由于知识的价值取决于现有的数据,我们用有限和大的数据来说明结果。虽然总体结果显示,知识由于学习的准确性较高而导致知识的准确性降低,从而限制或指导结构学习学习学习。每一种方法在结构上的学习效果上都不太重要,但某些主要观测方法的精髓意味着数据在计算中发现,因为从大数据中发现,因此可能减少主要数据在实际分析中发现,因此比较重要。在空间分析中发现,因此,因此可能减少。在空间研究中较复杂。在空间中较重要。