Learning-based controllers, such as neural network (NN) controllers, can show high empirical performance but lack formal safety guarantees. To address this issue, control barrier functions (CBFs) have been applied as a safety filter to monitor and modify the outputs of learning-based controllers in order to guarantee the safety of the closed-loop system. However, such modification can be myopic with unpredictable long-term effects. In this work, we propose a safe-by-construction NN controller which employs differentiable CBF-based safety layers, and investigate the performance of safe-by-construction NN controllers in learning-based control. Specifically, two formulations of controllers are compared: one is projection-based and the other relies on our proposed set-theoretic parameterization. Both methods demonstrate improved closed-loop performance over using CBF as a separate safety filter in numerical experiments.
翻译:以学习为基础的控制器,如神经网络控制器,可以显示较高的实证性能,但缺乏正式的安全保障。为解决这一问题,控制屏障功能(CBFs)被用作安全过滤器,用于监测和修改学习控制器的输出,以保证闭路控制器系统的安全。但是,这种修改可能是短视的,具有无法预测的长期影响。在这项工作中,我们提议一个安全逐部的NNC控制器,使用不同的 CBF 安全层,并调查在基于学习的控制中安全就施工的NNC控制器的性能。具体地说,对控制器的两种配方进行了比较:一种以预测为基础,另一种以我们提议的定置理论参数参数化为依据。两种方法都表明在数字实验中使用CBF作为单独的安全过滤器方面提高了闭路性能。