Comparing competing mathematical models of complex natural processes is a shared goal among many branches of science. The Bayesian probabilistic framework offers a principled way to perform model comparison and extract useful metrics for guiding decisions. However, many interesting models are intractable with standard Bayesian methods, as they lack a closed-form likelihood function or the likelihood is computationally too expensive to evaluate. With this work, we propose a novel method for performing Bayesian model comparison using specialized deep learning architectures. Our method is purely simulation-based and circumvents the step of explicitly fitting all alternative models under consideration to each observed dataset. Moreover, it requires no hand-crafted summary statistics of the data and is designed to amortize the cost of simulation over multiple models and observable datasets. This makes the method particularly effective in scenarios where model fit needs to be assessed for a large number of datasets, so that per-dataset inference is practically infeasible.Finally, we propose a novel way to measure epistemic uncertainty in model comparison problems. We demonstrate the utility of our method on toy examples and simulated data from non-trivial models from cognitive science and single-cell neuroscience. We show that our method achieves excellent results in terms of accuracy, calibration, and efficiency across the examples considered in this work. We argue that our framework can enhance and enrich model-based analysis and inference in many fields dealing with computational models of natural processes. We further argue that the proposed measure of epistemic uncertainty provides a unique proxy to quantify absolute evidence even in a framework which assumes that the true data-generating model is within a finite set of candidate models.
翻译:比较复杂的自然过程的相互竞争的数学模型是许多科学分支的一个共同目标。 巴伊西亚概率框架提供了一种原则性的方法,用于进行模型比较和提取用于指导决策的有用指标。然而,许多有趣的模型与标准的巴伊西亚方法不相符合,因为它们缺乏封闭式概率功能,或者在计算上可能太昂贵,难以评估。有了这项工作,我们提出了使用专门的深层次学习结构进行巴伊西亚模型比较的新颖方法。我们的方法纯粹是模拟为基础的,绕过了所有被观察数据集所考虑的绝对模型的明确匹配步骤。此外,它不需要对数据进行手动的简要统计,而是用来对多个模型和可观测数据集的模拟成本进行摊合。这种方法在模型适合大量数据集评估的情景中特别有效,因此,使用专门的深层学习结构推理算,我们提出的模型的精确度框架可以进一步测量模型比较问题中的不确定性。我们展示了方法的实用性实例和模拟数据来自非三角模型和可观测的数据集中,我们从精确性模型和精确性模型中展示了我们所研究的精确度分析结果。我们从一个在精确度模型和精确度模型中,我们所研究的精确度分析的精确度模型中的精确度模型中,我们从一个在精确度模型中可以评估的精确度模型中可以评估的模型和精确度分析结果中,我们从一个在精确性模型的模型的模型中推导算出一个在精确性模型中可以推算出一个精确性模型中推算。