There has been a recent interest in imitation learning methods that are guaranteed to produce a stabilizing control law with respect to a known system. Work in this area has generally considered linear systems and controllers, for which stabilizing imitation learning takes the form of a biconvex optimization problem. In this paper it is demonstrated that the same methods developed for linear systems and controllers can be readily extended to polynomial systems and controllers using sum of squares techniques. A projected gradient descent algorithm and an alternating direction method of multipliers algorithm are proposed as heuristics for solving the stabilizing imitation learning problem, and their performance is illustrated through numerical experiments.
翻译:最近有人对模仿学习方法感兴趣,这些方法保证对已知的系统产生稳定控制法,这一领域的工作一般考虑到线性系统和控制器,对于这些系统和控制器,稳定模仿学习的形式是双子座体优化问题;本文件表明,为线性系统和控制器开发的相同方法可以很容易地推广到多球系系统和控制器,使用平方体技术的和和数法。 预测的梯度下移算法和乘数算法的交替方向法被提议为解决稳定模仿学习问题的惯性方法,其性能通过数字实验加以说明。