Controllers for dynamical systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modeled as process noise in a dynamical system, and common assumptions are that the underlying distributions are known and/or Gaussian. In practice, however, these assumptions may be unrealistic and can lead to poor approximations of the true noise distribution. We present a novel controller synthesis method that does not rely on any explicit representation of the noise distributions. In particular, we address the problem of computing a controller that provides probabilistic guarantees on safely reaching a target, while also avoiding unsafe regions of the state space. First, we abstract the continuous control system into a finite-state model that captures noise by probabilistic transitions between discrete states. As a key contribution, we adapt tools from the scenario approach to compute probably approximately correct (PAC) bounds on these transition probabilities, based on a finite number of samples of the noise. We capture these bounds in the transition probability intervals of a so-called interval Markov decision process (iMDP). This iMDP is, with a user-specified confidence probability, robust against uncertainty in the transition probabilities, and the tightness of the probability intervals can be controlled through the number of samples. We use state-of-the-art verification techniques to provide guarantees on the iMDP and compute a controller for which these guarantees carry over to the original control system. In addition, we develop a tailored computational scheme that reduces the complexity of the synthesis of these guarantees on the iMDP. Benchmarks on realistic control systems show the practical applicability of our method, even when the iMDP has hundreds of millions of transitions.
翻译:在安全临界环境下运行的动态系统控制器的主计长必须说明在安全临界环境下运行的干扰。这种扰动通常以动态系统中的流程噪音为模型,通常的假设是基本分布为已知和(或)高森。 但是,在实践中,这些假设可能是不现实的,可能导致真实噪音分布的近似性差。我们提出了一个新的控制器合成方法,它并不依赖于噪音分布的任何明确表示。特别是,我们处理计算一个控制器的问题,该控制器为安全到达目标提供概率性保障,同时避免州空间不安全的区域。首先,我们将持续控制系统转化为一个固定状态模型,通过离散州之间的概率转换捕捉到噪音。作为关键的贡献,我们根据噪音分布的样本数量有限,根据这些过渡概率,我们甚至从所谓的马可(iMDP)决定过程的过渡概率间隔中捕捉到这些约束。这种iMDP的连续性控制系统,随着用户对精确度的过渡的概率的概率的精确度的精确度的精确度的精确度的精确度而降低。