通过均等运动会对存储系统进行象征性控制 (Symbolic Control for Stochastic Systems via Parity Games)

We consider the problem of computing the maximal probability of satisfying an $\omega$-regular specification for stochastic, continuous-state, nonlinear systems evolving in discrete time. The problem reduces, after automata-theoretic constructions, to finding the maximal probability of satisfying a parity condition on a (possibly hybrid) state space. While characterizing the exact satisfaction probability is open, we show that a lower bound on this probability can be obtained by (I) computing an under-approximation of the qualitative winning region, i.e., states from which the parity condition can be enforced almost surely, and (II) computing the maximal probability of reaching this qualitative winning region. The heart of our approach is a technique to symbolically compute the under-approximation of the qualitative winning region in step (I) via a finite-state abstraction of the original system as a $2\frac{1}{2}$-player parity game. Our abstraction procedure uses only the support of the probabilistic evolution; it does not use precise numerical transition probabilities. We prove that the winning set in the abstract $2\frac{1}{2}$-player game induces an under-approximation of the qualitative winning region in the original synthesis problem, along with a policy to solve it. By combining these contributions with (a) existing symbolic fixpoint algorithms to solve $2\frac{1}{2}$-player games and (b) existing techniques for reachability policy synthesis in stochastic nonlinear systems, we get an abstraction-based symbolic algorithm for finding a lower bound on the maximal satisfaction probability. We have implemented our approach and evaluated it on the nonlinear model of the perturbed Dubins vehicle.

翻译：我们考虑的是计算满足美元=oomega2 常规规则的最大概率的问题。在不连续的时间里, 持续状态和非线性系统在不连续的时间里演变。在自动化理论构造后, 问题会降低在( 可能混合的) 状态空间中找到满足对等条件的最大概率。虽然精确满意度概率的特征是开放的, 我们显示, 计算质量赢取区域低于对等值的方法可以降低这一概率的约束值, 即, 可以几乎肯定地执行对等条件的国家, 并且( II) 计算达到这个质量赢取区域的最大概率。我们的方法的核心是象征性地将质量赢取区域低于对等( I), 将原始系统定点抽象地抽象地计算为 2\ frac { 1\\\\\ 美元平价游戏。我们的解决程序只能使用对等值进化方法的支持; 它不会使用精确的数值过渡性非正值的精确性策略。我们的策略的核心是, 正在以正向的正向的平价化区域。