反应网络有条件的蒙特卡洛 (Conditional Monte Carlo for Reaction Networks)

from arxiv, Accepted version to SIAM Journal on Scientific Computing (SISC). Supplementary material included at end. Example Matlab code can be found at https://github.com/kehlert/conditional_monte_carlo_example

Reaction networks are often used to model interacting species in fields such as biochemistry and ecology. When the counts of the species are sufficiently large, the dynamics of their concentrations are typically modeled via a system of differential equations. However, when the counts of some species are small, the dynamics of the counts are typically modeled stochastically via a discrete state, continuous time Markov chain. A key quantity of interest for such models is the probability mass function of the process at some fixed time. Since paths of such models are relatively straightforward to simulate, we can estimate the probabilities by constructing an empirical distribution. However, the support of the distribution is often diffuse across a high-dimensional state space, where the dimension is equal to the number of species. Therefore generating an accurate empirical distribution can come with a large computational cost. We present a new Monte Carlo estimator that fundamentally improves on the "classical" Monte Carlo estimator described above. It also preserves much of classical Monte Carlo's simplicity. The idea is basically one of conditional Monte Carlo. Our conditional Monte Carlo estimator has two parameters, and their choice critically affects the performance of the algorithm. Hence, a key contribution of the present work is that we demonstrate how to approximate optimal values for these parameters in an efficient manner. Moreover, we provide a central limit theorem for our estimator, which leads to approximate confidence intervals for its error.

翻译：在生物化学和生态等领域,常常使用反应网络来模拟相互作用的物种,例如生物化学和生态学。当物种的计数足够大时,其浓度的动态通常是通过差异方程系统模拟的。然而,当某些物种的计数较小时,计数的动态通常是通过离散状态、连续时间马可夫链进行模拟的。这些模型的主要兴趣是过程在某个固定时间段的概率质量功能。由于这些模型的路径对模拟来说相对简单,我们可以通过建立经验分布来估计概率。然而,其分布的动态通常通过一个高维度空间进行模型的模型。当某些物种的计数小时,这些物种的动态通常通过一个分离状态进行模拟的模拟。但是,当某些物种的计数数量,这些物种的动态的动态通常会通过一个与物种数量相等的高度状态空间来模拟。因此,产生准确的经验性分布的动态可以通过一个巨大的计算成本来进行模拟。我们推出一个新的蒙特卡洛的测算仪,它从根本上改进了“古典”蒙特卡洛测算仪的精度功能。这个概念基本上是一个有条件的蒙特卡洛。我们有条件的测算师有两种参数,我们目前的测算的精确的参数,它能影响着一个核心的精度的精度的精度的精确度。