We consider the problem of computing the maximal probability of satisfying an omega-regular specification for stochastic nonlinear systems evolving in discrete time. The problem reduces, after automata-theoretic constructions, to finding the maximal probability of satisfying a parity condition on a (possibly hybrid) state space. While characterizing the exact satisfaction probability is open, we show that a lower bound on this probability can be obtained by (I) computing an under-approximation of the qualitative winning region, i.e., states from which the parity condition can be enforced almost surely, and (II) computing the maximal probability of reaching this qualitative winning region. The heart of our approach is a technique to symbolically compute the under-approximation of the qualitative winning region in step (I) via a finite-state abstraction of the original system as a 2.5-player parity game. Our abstraction procedure uses only the support of the probabilistic evolution; it does not use precise numerical transition probabilities. We prove that the winning set in the abstract 2.5-player game induces an under-approximation of the qualitative winning region in the original synthesis problem, along with a policy to solve it. By combining these contributions with (a) a symbolic fixpoint algorithm to solve 2.5-player games and (b) existing techniques for reachability policy synthesis in stochastic nonlinear systems, we get an abstraction-based algorithm for finding a lower bound on the maximal satisfaction probability. We have implemented the abstraction-based algorithm in Mascot-SDS (Majumdar et al., 2020), where we combined the outlined abstraction step with our recent tool FairSyn. We evaluated our implementation on the nonlinear model of a perturbed bistable switch from the literature. We outperform a recently proposed tool for solving this problem by a large margin.
翻译:我们考虑的是计算满足离散时间演变的非直线性系统抽象算法规则最高概率的问题。 问题在于, 在自动算法理论构造后, 如何在( 可能的混合) 状态空间中找到满足对等条件的最大概率。 虽然精确的满意度概率的特征是开放的, 我们显示, 可以通过( I) 计算一个质量赢取区域不完全匹配的偏差值, 也就是说, 能够几乎肯定地执行对等性条件, 并且( II) 计算达到这个定性赢利区域的最大概率。 我们的方法的核心是, 以象征方式将原系统低于对等的对等性条件进行对等性测试的最大概率。 我们的抽象程序只使用模型变异性的支持, 也就是说, 并不使用精确的数字转换概率。 我们证明, 在抽象的 2.5 游戏中的非最大概率概率, 与正压的对等值的对等值的对等值的对等值 。 ( 通过将原始的对正值的对正值的对等的对正值 ), 将我们原始的对正值的对正值的对正值的对正值 。