Linear dynamical systems are canonical models for learning-based control of plants with uncertain dynamics. The setting consists of a stochastic differential equation that captures the state evolution of the plant understudy, while the true dynamics matrices are unknown and need to be learned from the observed data of state trajectory. An important issue is to ensure that the system is stabilized and destabilizing control actions due to model uncertainties are precluded as soon as possible. A reliable stabilization procedure for this purpose that can effectively learn from unstable data to stabilize the system in a finite time is not currently available. In this work, we propose a novel Bayesian learning algorithm that stabilizes unknown continuous-time stochastic linear systems. The presented algorithm is flexible and exposes effective stabilization performance after a remarkably short time period of interacting with the system.
翻译:线性动态系统是以学习为基础控制动态不确定的植物的典型模式。设置包括一种随机差异方程式,它捕捉到厂底研究的状态演变,而真正的动态矩阵则未知,需要从观察到的状态轨迹数据中学习。一个重要问题是确保系统稳定下来,并尽快排除模型不确定性造成的不稳定控制行动。目前没有为此目的建立可靠的稳定程序,从不稳定数据中有效学习,以在有限的时间内稳定系统。在这个工作中,我们提出一个新颖的巴耶斯学习算法,以稳定未知的连续时间随机线性系统。所提出的算法是灵活的,在与系统互动的时间非常短之后暴露有效的稳定性表现。