Providing non-trivial certificates of safety for non-linear stochastic systems is an important open problem that limits the wider adoption of autonomous systems in safety-critical applications. One promising solution to address this problem is barrier functions. The composition of a barrier function with a stochastic system forms a supermartingale, thus enabling the computation of the probability that the system stays in a safe set over a finite time horizon via martingale inequalities. However, existing approaches to find barrier functions for stochastic systems generally rely on convex optimization programs that restrict the search of a barrier to a small class of functions such as low degree SoS polynomials and can be computationally expensive. In this paper, we parameterize a barrier function as a neural network and show that techniques for robust training of neural networks can be successfully employed to find neural barrier functions. Specifically, we leverage bound propagation techniques to certify that a neural network satisfies the conditions to be a barrier function via linear programming and then employ the resulting bounds at training time to enforce the satisfaction of these conditions. We also present a branch-and-bound scheme that makes the certification framework scalable. We show that our approach outperforms existing methods in several case studies and often returns certificates of safety that are orders of magnitude larger.
翻译:提供非线性随机系统安全非边际安全证书是一个重要的开放问题,它限制在安全关键应用中更广泛地采用自主系统。解决这一问题的一个有希望的解决方案是屏障功能。一个带有随机系统的屏障功能构成形成一个超边线,从而能够通过马丁格尔的不平等来计算系统在一定的时限内在安全设置中停留的可能性。然而,目前找到非线性随机系统屏障功能的方法一般依赖于连接优化程序,这种程序限制了在安全关键应用中广泛采用自主系统的屏障,从而限制了对一小类功能的搜索,如低度 SoS多面体和可计算成本。在本文中,我们将屏障功能作为神经网络的设置参数,并表明对神经网络进行强力培训的技术能够成功地用于找到神经屏障功能。具体地说,我们利用捆绑的传播技术来证明神经网络能够通过线性编程满足屏障功能的条件,然后在培训时使用由此形成的界限来强制执行这些条件的满意度。我们还提出一个分支和宽度计划,使认证框架的验证方法往往具有较大的规模。