A new hybrid Bayesian network learning algorithm, termed Forward Early Dropping Hill Climbing (FEDHC), devised to work with either continuous or categorical variables. FEDHC consists of a skeleton identification phase and a subsequent scoring phase that assigns the (causal) directions. Further, the paper manifests that the only implementation of MMHC in the statistical software \textit{R}, is prohibitively expensive and a new implementation is offered. In addition, specifically for the case of continuous data, a robust to outliers version of FEDHC, that can be adopted by other BN learning algorithms as well is proposed. The FEDHC is tested via Monte Carlo simulations that distinctly show it is computationally efficient, and produces Bayesian networks of similar to, or of higher accuracy than MMHC and PCHC. Specifically, FEDHC yields more accurate Bayesian networks than PCHC with continuous data but less accurate with categorical data. Finally, an application of FEDHC, PCHC and MMHC algorithms to real data, from the field of economics, is demonstrated using the statistical software \textit{R}.
翻译:此外,该文件还表明,统计软件\ textit{R}中只有执行MMHC才具有令人望而却步的成本,并且提供了一种新的实施方法。此外,具体地说,就连续数据而言,还提议采用FEDHC的强力外推法,其他BN的学习算法也可以采用FEDHC。FEDHC通过Monte Carlo模拟测试,明显显示它具有计算效率,并产生比MMHC和PCHC类似或更精准的Bayesian网络。具体地说,FEDHC的精确巴伊西亚网络比PCHC的连续数据更准确,但与绝对数据不那么精确。最后,使用统计软件演示了FEDHC、PCHC和MMHC的算法对来自经济领域的真实数据的应用。