通过斯托卡反应网络的斯托卡式最佳控制,通过斯托卡式反应网络的斯托卡式最佳控制进行高效重要性抽样 (Efficient Importance Sampling via Stochastic Optimal Control for Stochastic Reaction Networks)

We explore the efficient estimation of statistical quantities, particularly rare event probabilities, for stochastic reaction networks. We propose a novel importance sampling (IS) approach to improve the efficiency of Monte Carlo (MC) estimators when based on an approximate tau-leap scheme. In the IS framework, it is crucial to choose an appropriate change of probability measure for achieving substantial variance reduction. Based on an original connection between finding the optimal IS parameters within a class of probability measures and a stochastic optimal control (SOC) formulation, we propose an automated approach to obtain a highly efficient path-dependent measure change. The optimal IS parameters are obtained by solving a variance minimization problem. We derive an associated backward equation solved by these optimal parameters. Given the challenge of analytically solving this backward equation, we propose a numerical dynamic programming algorithm to approximate the optimal control parameters. To mitigate the curse of dimensionality issue caused by solving the backward equation in the multi-dimensional case, we propose a learning-based method that approximates the value function using a neural network, the parameters of which are determined via stochastic optimization. Our numerical experiments show that our learning-based IS approach substantially reduces the variance of the MC estimator. Moreover, when applying the numerical dynamic programming approach for the one-dimensional case, we obtained a variance that decays at a rate of $\mathcal{O}(\Delta t)$ for a step size of $\Delta t$, compared to $\mathcal{O}(1)$ for a standard MC estimator. For a given prescribed error tolerance, $\text{TOL}$, this implies an improvement in the computational complexity to become $\mathcal{O}(\text{TOL}^{-2})$ instead of $\mathcal{O}(\text{TOL}^{-3})$ when using a standard MC estimator.

翻译：我们探索对统计量的高效估算,特别是{稀有事件概率 { 随机反应网络 { 的统计量 { 稀有事件 { 随机反应网络。我们提议一种自动化方法, 以高效路径为根据的测量 } 来提高蒙特卡洛( MC) 估测器的效率。在大约tau- leep 方案框架内, 选择适当的概率度量改变以大幅减少差异至关重要。根据在概率计量的类别中找到最佳的IS 参数与随机最佳控制( SOC) 配制, 我们提议一种自动方法, 以获得高效路径为主的测量度变化 } 。最佳IS 参数是通过解决差异最小化问题获得的。我们从这些最佳参数中得出了一个相关的后向方公式。鉴于分析解决这个后方程式的挑战, 我们提出一个数字动态程序算算法, 也就是在以美元- 美元( 美元 ) 的轨算法中, 我们用一个数字变数法来减少一个以。