This paper introduces the Generalized Action Governor, which is a supervisory scheme for augmenting a nominal closed-loop system with the capability of strictly handling constraints. After presenting its theory for general systems and introducing tailored design approaches for linear and discrete systems, we discuss its application to safe online learning, which aims to safely evolve control parameters using real-time data to improve performance for uncertain systems. In particular, we propose two safe learning algorithms based on integration of reinforcement learning/data-driven Koopman operator-based control with the generalized action governor. The developments are illustrated with a numerical example.
翻译:本文介绍通用行动总督,这是一个监督计划,旨在以严格处理限制的能力扩大名义上封闭性循环系统,我们在提出一般系统的理论和对线性系统和离散系统采用量身定做的设计方法之后,讨论了其对安全在线学习的应用,其目的是利用实时数据安全地发展控制参数,以改善不确定系统的性能,特别是,我们提议两种安全学习算法,其基础是把强化学习/数据驱动的库普曼操作员控制与通用行动州长相结合。