Describing the relationship between the variables in a study domain and modelling the data generating mechanism is a fundamental problem in many empirical sciences. Probabilistic graphical models are one common approach to tackle the problem. Learning the graphical structure is computationally challenging and a fervent area of current research with a plethora of algorithms being developed. To facilitate the benchmarking of different methods, we present a novel automated workflow, called benchpress for producing scalable, reproducible, and platform-independent benchmarks of structure learning algorithms for probabilistic graphical models. Benchpress is interfaced via a simple JSON-file, which makes it accessible for all users, while the code is designed in a fully modular fashion to enable researchers to contribute additional methodologies. Benchpress currently provides an interface to a large number of state-of-the-art algorithms from libraries such as BiDAG, bnlearn, GOBNILP, pcalg, r.blip, scikit-learn, TETRAD, and trilearn as well as a variety of methods for data generating models and performance evaluation. Alongside user-defined models and randomly generated datasets, the software tool also includes a number of standard datasets and graphical models from the literature, which may be included in a benchmarking workflow. We demonstrate the applicability of this workflow for learning Bayesian networks in four typical data scenarios. The source code and documentation is publicly available from http://github.com/felixleopoldo/benchpress.
翻译:描述研究领域的变量与数据生成机制建模之间的关系是许多实验科学中的一个基本问题。 概率图形模型是解决这一问题的一个共同方法。 学习图形结构具有计算上的挑战性, 也是当前研究的一个热门领域, 正在开发大量的算法。 为了便利对不同方法进行基准化, 我们提出了一个新的自动化工作流程, 称为“ 塔普 ”, 用于为概率化图形模型制作可缩放、 可复制和基于平台的结构学习算法基准。 会场压通过一个简单的 JSON 文件接口, 使所有用户都能使用, 而代码则以完全模块化的方式设计, 使研究人员能够贡献更多的方法。 双面压目前为来自BiDAG、 bnlearn、 GOBNILP、 pcaleg、 r. blipp、 sclip、 scikitrig-learn、 TETubrad和 trilearararrann, 以及数据生成模型和绩效评估的多种方法。 与用户定义模型和随机生成的可应用性数据库/ dismagial smable cumental 一起, 学习了数据库。 在四种数据库中, 数据库中可以将数据库中的数据库中的数据库和数据库中的数据库中的数据库中的数据库中的数据库和数据库中的数据库中的数据库中的数据库和数据库。