Inferring the causal structure of a system typically requires interventional data, rather than just observational data. Since interventional experiments can be costly, it is preferable to select interventions that yield the maximum amount of information about a system. We propose a novel Bayesian method for optimal experimental design by sequentially selecting interventions that minimize the expected posterior entropy as rapidly as possible. A key feature is that the method can be implemented by computing simple summaries of the current posterior, avoiding the computationally burdensome task of repeatedly performing posterior inference on hypothetical future datasets drawn from the posterior predictive. After deriving the method in a general setting, we apply it to the problem of inferring causal networks. We present a series of simulation studies in which we find that the proposed method performs favorably compared to existing alternative methods. Finally, we apply the method to real and simulated data from a protein-signaling network.
翻译:由于干预实验费用高昂,因此最好选择能产生系统最大数量信息的干预措施。我们建议采用一种新型贝叶斯方法,通过按顺序选择干预措施,优化实验设计,尽可能快地将预期的后继星体最小化。一个关键特征是,该方法可以通过计算当前后继星的简单摘要来实施,避免对从后继预测中提取的假设的未来数据集反复进行后继推论的计算繁琐任务。在得出该方法之后,我们将其应用于推断因果网络的问题。我们提出了一系列模拟研究,发现拟议方法与现有的替代方法相比,效果良好。最后,我们将这种方法应用于从蛋白质信号网络获取的真实和模拟数据。