Bayesian model comparison (BMC) offers a principled approach for assessing the relative merits of competing computational models and propagating uncertainty into model selection decisions. However, BMC is often intractable for the popular class of hierarchical models due to their high-dimensional nested parameter structure. To address this intractability, we propose a deep learning method for performing BMC on any set of hierarchical models which can be instantiated as probabilistic programs. Since our method enables amortized inference, it allows efficient re-estimation of posterior model probabilities and fast performance validation prior to any real-data application. In a series of extensive validation studies, we benchmark the performance of our method against the state-of-the-art bridge sampling method and demonstrate excellent amortized inference across all BMC settings. We then use our method to compare four hierarchical evidence accumulation models that have previously been deemed intractable for BMC due to partly implicit likelihoods. In this application, we corroborate evidence for the recently proposed L\'evy flight model of decision-making and show how transfer learning can be leveraged to enhance training efficiency. Reproducible code for all analyses is provided.
翻译:贝叶西亚模型比较(BMC)为评估相互竞争的计算模型的相对优点和在模型选择决定中宣传不确定性提供了一个原则性方法,然而,BMC由于其高维嵌套参数结构,往往难以为流行的等级模型类别所利用。为解决这一可忽略不计的问题,我们提议了一种深层学习方法,用以在可以即时作为概率程序的任何一套等级模型上进行BMC。由于我们的方法能够进行分解推断,因此在应用任何真实数据之前,可以有效地重新估计后生模型的概率和快速性能验证。在一系列广泛的验证研究中,我们根据最先进的桥梁取样方法衡量我们方法的性能,并展示出所有BMC环境的极佳折价推法。我们然后使用我们的方法比较四个以前被认为由于部分隐含的可能性而对BMC来说难以使用的等级证据积累模型。在这个应用中,我们证实了最近提议的L\'evy飞行模型在决策中应用的证据,并表明如何利用转移学习来提高培训效率。