The graph structure of a Bayesian network (BN) can be learned from data using the well-known score-and-search approach. Previous work has shown that incorporating structured representations of the conditional probability distributions (CPDs) into the score-and-search approach can improve the accuracy of the learned graph. In this paper, we present a novel approach capable of learning the graph of a BN and simultaneously modelling linear and non-linear local probabilistic relationships between variables. We achieve this by a combination of feature selection to reduce the search space for local relationships and extending the score-and-search approach to incorporate modelling the CPDs over variables as Multivariate Adaptive Regression Splines (MARS). MARS are polynomial regression models represented as piecewise spline functions. We show on a set of discrete and continuous benchmark instances that our proposed approach can improve the accuracy of the learned graph while scaling to instances with a large number of variables.
翻译:Bayesian 网络(BN) 的图形结构可以从使用众所周知的得分和搜索方法的数据中学习。先前的工作表明,将有条件概率分布(CPDs)的结构性表示纳入得分和搜索方法可以提高所学图表的准确性。在本文中,我们提出了一个新颖的方法,能够学习BN的图形,同时模拟各变量之间的线性和非线性当地概率关系。我们通过结合地物选择来实现这一点,以减少本地关系的搜索空间,并扩大得分和研究方法,将CPC与变量的模型化作为多变量(MARS)的适应性递减曲线(MARS)。MARS是多数值回归模型,代表成片形样样的样条函数。我们用一组连续的基准实例显示,我们提出的方法可以提高所学图的准确性,同时将大量变量放大为实例。