Motivated by applications to the dynamic control of queueing networks, we develop a simulation-based scheme, the so-called multilevel Picard (MLP) approximation, for solving high-dimensional drift control problems whose states are constrained to stay within the nonnegative orthant, over a finite time horizon. We prove that under suitable conditions, the MLP approximation overcomes the curse of dimensionality in the following sense: To approximate the value function and its gradient evaluated at a given time and state to within a prescribed accuracy $\varepsilon$, the computational complexity grows at most polynomially in the problem dimension $d$ and $1/\varepsilon$. To illustrate the effectiveness of the scheme, we carry out numerical experiments for a class of test problems that are related to the dynamic scheduling problem of parallel server systems in heavy traffic, and demonstrate that the scheme is computationally feasible up to dimension at least $20$.
翻译:受排队网络动态控制应用的启发,我们开发了一种基于模拟的求解方案——即所谓多层皮卡德(MLP)近似法——用于求解高维漂移控制问题,其状态在有限时间范围内被约束保持在非负象限内。我们证明,在适当条件下,MLP近似法能以如下方式克服维度灾难:为在给定时间和状态下将值函数及其梯度近似至预设精度$\varepsilon$,其计算复杂度至多以问题维度$d$和$1/\varepsilon$的多项式形式增长。为说明该方案的有效性,我们对一类与重负载下并行服务器系统动态调度问题相关的测试问题进行了数值实验,并证明该方案至少在维度20以内具有计算可行性。