The paper proposes a new hybrid Bayesian network learning algorithm, termed Forward Early Dropping Hill Climbing (FEDHC), designed to work with either continuous or categorical data. FEDHC consists of a skeleton identification phase (learning the conditional associations among the variables) followed by the scoring phase that assigns the causal directions. Specifically for the case of continuous data, a robust to outliers version of FEDHC is also proposed. The paper manifests that the only implementation of MMHC in the statistical software \textit{R}, is prohibitively expensive and a new implementation is offered. The FEDHC is tested via Monte Carlo simulations that distinctly show it is computationally efficient, and produces Bayesian networks of similar to, or of higher accuracy than MMHC and PCHC. FEDHC yields more accurate Bayesian networks than PCHC with continuous data but less accurate with categorical data. Finally, an application of FEDHC, PCHC and MMHC algorithms to real data, from the field of economics, is demonstrated using the statistical software \textit{R}.
翻译:该文件提出了一个新的混合贝叶西亚网络学习算法,称为 " 前早期下降山爬行 " (FEDHC),旨在利用连续或绝对的数据开展工作。FEDHC包括一个骨干识别阶段(学习各种变量之间的有条件关联),然后是分出因果方向的评分阶段。具体地说,就连续数据而言,还提出了对异端版本的FEDHC的有力建议。该文件表明,在统计软件\ textit{R}中,MMMHC的唯一实施费用太高,而且提供了新的实施。FEDHC通过蒙特卡洛模拟测试,明显地表明它具有计算效率,并生成了类似于或比MMHC和PCHC更精确的巴伊西亚网络。FEDHC产生比PCHC更准确的波亚网络,连续数据,但与绝对数据不那么准确。最后,FEDHC、PCHC和MHC算法对来自经济领域的真实数据的应用,使用统计软件\textit{R}得到证明。