Safety is critical in autonomous robotic systems. A safe control law ensures forward invariance of a safe set (a subset in the state space). It has been extensively studied regarding how to derive a safe control law with a control-affine analytical dynamic model. However, in complex environments and tasks, it is challenging and time-consuming to obtain a principled analytical model of the system. In these situations, data-driven learning is extensively used and the learned models are encoded in neural networks. How to formally derive a safe control law with Neural Network Dynamic Models (NNDM) remains unclear due to the lack of computationally tractable methods to deal with these black-box functions. In fact, even finding the control that minimizes an objective for NNDM without any safety constraint is still challenging. In this work, we propose MIND-SIS (Mixed Integer for Neural network Dynamic model with Safety Index Synthesis), the first method to derive safe control laws for NNDM. The method includes two parts: 1) SIS: an algorithm for the offline synthesis of the safety index (also called as barrier function), which uses evolutionary methods and 2) MIND: an algorithm for online computation of the optimal and safe control signal, which solves a constrained optimization using a computationally efficient encoding of neural networks. It has been theoretically proved that MIND-SIS guarantees forward invariance and finite convergence. And it has been numerically validated that MIND-SIS achieves safe and optimal control of NNDM. From our experiments, the optimality gap is less than $10^{-8}$, and the safety constraint violation is $0$.
翻译:在自主机器人系统中,安全是安全的。安全控制法确保安全套件(州空间的一个子集)的前方不易发生安全套件(州空间的一个子集),对于如何制定安全控制法进行广泛研究,使用控制室分析动态模型分析模型分析安全套件。然而,在复杂的环境和任务中,获得系统的原则分析模型是具有挑战性和耗时的。在这种情况下,广泛使用数据驱动的学习方法,并将所学模型编码在神经网络网络网络中。如何正式利用神经网络动态模型(NNDM)制定安全套件(NNDM)安全套件(NNDM)的安全控制法(NDM);由于缺乏可计算的方法来处理这些黑箱功能。事实上,即使找到将NDDMD目标降到最低程度而没有任何安全限制的控制措施,在这项工作中,MIND-SIS(NG Integrad Integer)第一个获得安全控制法律编码的方法是:1)SIS:安全指数离线合成的算法(也称为屏障功能),它使用进化方法,从进化方法,而MNDMD(MD)的升级的升级的升级的精确控制系统,它已经得到一个最优化的升级的升级的计算。