Approximate Bayesian Computation (ABC) now serves as one of the major strategies to perform model choice and parameter inference on models with intractable likelihoods. An essential component of ABC involves comparing a large amount of simulated data with the observed data through summary statistics. To avoid the curse of dimensionality, summary statistic selection is of prime importance, and becomes even more critical when applying ABC to mechanistic network models. Indeed, while many summary statistics can be used to encode network structures, their computational complexity can be highly variable. For large networks, computation of summary statistics can quickly create a bottleneck, making the use of ABC difficult. To reduce this computational burden and make the analysis of mechanistic network models more practical, we investigated two questions in a model choice framework. First, we studied the utility of cost-based filter selection methods to account for different summary costs during the selection process. Second, we performed selection using networks generated with a smaller number of nodes to reduce the time required for the selection step. Our findings show that computationally inexpensive summary statistics can be efficiently selected with minimal impact on classification accuracy. Furthermore, we found that networks with a smaller number of nodes can only be employed to eliminate a moderate number of summaries. While this latter finding is network specific, the former is general and can be adapted to any ABC application.
翻译:近似Bayesian Computation(ABC)目前是执行模型选择和参数推断的主要战略之一。ABC的一个基本组成部分是通过摘要统计将大量模拟数据与观察到的数据进行比较。为了避免维度的诅咒,简要统计选择至关重要,在对机械网络模型应用ABC时甚至更加关键。事实上,许多摘要统计可以用来对网络结构进行编码,它们的计算复杂性可以变化很大。对于大型网络来说,计算摘要统计可以迅速造成瓶颈,使ABC难以使用。为了减少这一计算负担,并使机械网络模型的分析更加实用,我们在模型选择框架内调查了两个问题。首先,我们研究了基于成本的过滤选择方法的效用,以计算选择过程的不同摘要成本。第二,我们利用较少的节点生成的网络进行选择,以减少选择步骤所需的时间。对于大型网络来说,我们的调查结果表明,计算成本低廉的简要统计数据可以快速选择,对分类准确性影响最小。此外,我们发现,在模型选择之前,使用过的任何具体网络,只有使用过一个小的网络,而后,才能找到一个小的网络,而使用过一个小的缩的BC摘要。