Learning a Bayesian network is an NP-hard problem and with an increase in the number of nodes, classical algorithms for learning the structure of Bayesian networks become inefficient. In recent years, some methods and algorithms for learning Bayesian networks with a high number of nodes (more than 50) were developed. But these solutions have their disadvantages, for instance, they only operate one type of data (discrete or continuous) or their algorithm has been created to meet a specific nature of data (medical, social, etc.). The article presents a BigBraveBN algorithm for learning large Bayesian Networks with a high number of nodes (over 100). The algorithm utilizes the Brave coefficient that measures the mutual occurrence of instances in several groups. To form these groups, we use the method of nearest neighbours based on the Mutual information (MI) measure. In the experimental part of the article, we compare the performance of BigBraveBN to other existing solutions on multiple data sets both discrete and continuous. The experimental part also represents tests on real data. The aforementioned experimental results demonstrate the efficiency of the BigBraveBN algorithm in structure learning of Bayesian Networks.
翻译:a Bayesian网络是一个NP-硬性的问题,随着节点数量的增加,学习Bayesian网络结构的古典算法变得效率低下。近年来,开发了一些方法和算法来学习许多节点(50多个)的Bayesian网络。但是,这些解决办法有其缺点,例如,它们只操作一种类型的数据(分解或连续)或其算法,或者它们创建的算法是为了满足数据的具体性质(医疗、社会等)。文章展示了一种大BraveBN算法,用于学习众多节点(100多个)的大型Bayesian网络。该算法利用了衡量若干组间相互发生事件的布拉维系数。为了形成这些组,我们使用基于相互信息(MI)衡量的近邻方法。在文章的实验部分,我们将BigBraveBN的性能与多种离心和连续数据集的其他现有解决办法进行比较。实验部分还展示了对真实数据的测试。上述实验结果表明BraveBenx在Bayases网络结构中学习Bayasian 的Bregal算法的效率。