具有高斯过程的概率性强稳健的线性象形调节器 (Probabilistic robust linear quadratic regulators with Gaussian processes)

Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design. While learning-based control has the potential to yield superior performance in demanding applications, robustness to uncertainty remains an important challenge. Since Bayesian methods quantify uncertainty of the learning results, it is natural to incorporate these uncertainties into a robust design. In contrast to most state-of-the-art approaches that consider worst-case estimates, we leverage the learning method's posterior distribution in the controller synthesis. The result is a more informed and, thus, more efficient trade-off between performance and robustness. We present a novel controller synthesis for linearized GP dynamics that yields robust controllers with respect to a probabilistic stability margin. The formulation is based on a recently proposed algorithm for linear quadratic control synthesis, which we extend by giving probabilistic robustness guarantees in the form of credibility bounds for the system's stability.Comparisons to existing methods based on worst-case and certainty-equivalence designs reveal superior performance and robustness properties of the proposed method.

翻译：Gaussian 进程(GPs)等概率模型是从数据中学习未知动态系统的有力工具,供随后用于控制设计。虽然基于学习的控制有可能在要求应用程序方面产生优异的性能,但稳健的不确定性仍然是一个重大挑战。由于Bayesian方法量化了学习结果的不确定性,因此将这些不确定性纳入一个稳健的设计中是很自然的。与大多数考虑最坏情况估计的最先进的方法相比,我们利用了控制器合成中学习方法的后方分布。结果是更加知情,从而在性能和稳健之间实现更有效的平衡。我们为线性化的GP动态提供了一种新的控制器合成,在概率稳定幅度方面产生稳健的控制器。这种配方基于最近提出的线性二次曲线控制合成算法,我们通过以系统稳定性可信度界限的形式提供概率稳健的保证来扩展这一配方。对基于最坏情况和确定性等值设计的现有方法的匹配性,显示了拟议方法的优性能和稳健性。