Causal structure learning refers to a process of identifying causal structures from observational data, and it can have multiple applications in biomedicine and health care. This paper provides a practical review and tutorial on scalable causal structure learning models with examples of real-world data to help health care audiences understand and apply them. We reviewed traditional (combinatorial and score-based methods) for causal structure discovery and machine learning-based schemes. We also highlighted recent developments in biomedicine where causal structure learning can be applied to discover structures such as gene networks, brain connectivity networks, and those in cancer epidemiology. We also compared the performance of traditional and machine learning-based algorithms for causal discovery over some benchmark data sets. Machine learning-based approaches, including deep learning, have many advantages over traditional approaches, such as scalability, including a greater number of variables, and potentially being applied in a wide range of biomedical applications, such as genetics, if sufficient data are available. Furthermore, these models are more flexible than traditional models and are poised to positively affect many applications in the future.
翻译:因果关系结构学习是指从观察数据中找出因果结构的过程,它可以在生物医学和保健方面有多种应用。本文件对可扩展因果结构学习模型进行实际审查和辅导,并举例说明真实世界数据,以帮助保健对象理解和应用这些数据。我们审查了因果结构发现和机器学习计划的传统(基于cominal和分数的方法)方法。我们还着重介绍了生物医学方面的最新发展,其中因果结构学习可用于发现基因网络、脑连通网络和癌症流行病学等结构。我们还比较了传统和机器学习算法在一些基准数据集中的因果发现绩效。基于机器的学习方法,包括深层学习方法,对于传统方法,如可扩展性,包括更多变量,具有许多优势,而且如果有足够的数据,则有可能应用于广泛的生物医学应用,例如遗传学。此外,这些模型比传统模型更为灵活,并有望对未来许多应用产生积极影响。