We consider Markov decision processes with synchronizing objectives, which require that a probability mass of $1-\epsilon$ accumulates in a designated set of target states, either once, always, infinitely often, or always from some point on, where $\epsilon = 0$ for sure synchronizing, and $\epsilon \to 0$ for almost-sure and limit-sure synchronizing. We introduce two new qualitative modes of synchronizing, where the probability mass should be either positive, or bounded away from $0$. They can be viewed as dual synchronizing objectives. We present algorithms and tight complexity results for the problem of deciding if a Markov decision process is positive, or bounded synchronizing, and we provide explicit bounds on $\epsilon$ in all synchronizing modes. In particular, we show that deciding positive and bounded synchronizing always from some point on, is coNP-complete.
翻译:我们考虑Markov决策程序与同步目标同步,这要求1美元-欧元的概率质量在指定的一组目标状态中积累一次、一次、一次、无限期地、或总是从某个点,即美元=0美元肯定同步化,和美元=0美元至0美元几乎保证和有限度同步化。我们引入了两种新的同步质量模式,即概率质量要么是正数,要么是约束在$0之外。它们可以被视为双重同步目标。我们提出了算法和严格复杂的结果,以决定Markov决定程序是否正或捆绑同步化的问题,我们提供了在所有同步模式中对美元的明确约束。特别是,我们表明决定正数和捆绑的同步总是从某个点开始是正数,是完整的。