We consider the problem of controlling a mutated diffusion process with an unknown mutation time. The problem is formulated as the quickest intervention problem with the mutation modeled by a change-point, which is a generalization of the quickest change-point detection (QCD). Our goal is to intervene in the mutated process as soon as possible while maintaining a low intervention cost with optimally chosen intervention actions. This model and the proposed algorithms can be applied to pandemic prevention (such as Covid-19) or misinformation containment. We formulate the problem as a partially observed Markov decision process (POMDP) and convert it to an MDP through the belief state of the change-point. We first propose a grid approximation approach to calculate the optimal intervention policy, whose computational complexity could be very high when the number of grids is large. In order to reduce the computational complexity, we further propose a low-complexity threshold-based policy through the analysis of the first-order approximation of the value functions in the ``local intervention'' regime. Simulation results show the low-complexity algorithm has a similar performance as the grid approximation and both perform much better than the QCD-based algorithms.
翻译:我们考虑的是以未知突变时间控制变异扩散过程的问题。问题被表述为以变化点为模型的突变的最快速干预问题,变异点是快速变化点检测(QCD)的一般化。我们的目标是尽快干预变异过程,同时以最佳选择的干预行动保持低干预成本。这一模型和拟议的算法可以适用于大流行病预防(如Covid-19)或错误信息遏制。我们将问题作为部分观察到的Markov决策过程(POMDP)来表述,并通过变化点的信仰状态将其转换为MDP。我们首先提出一种电网近似法,以计算最佳干预政策,在电网数量庞大时,其计算复杂性可能非常高。为了减少计算复杂性,我们进一步提议采用低兼容性门槛政策,方法是分析“地方干预”制度中价值功能的一阶近似值。模拟结果显示,低兼容性算法与电网近似,其性与QCD的算法都好于Q。