We present a new method for learning control law that stabilizes an unknown nonlinear dynamical system at an equilibrium point. We formulate a system identification task in a self-supervised learning setting that jointly learns a controller and corresponding stable closed-loop dynamics hypothesis. The input-output behavior of the unknown dynamical system under random control inputs is used as the supervising signal to train the neural network-based system model and the controller. The proposed method relies on the Lyapunov stability theory to generate a stable closed-loop dynamics hypothesis and corresponding control law. We demonstrate our method on various nonlinear control problems such as n-link pendulum balancing and trajectory tracking, pendulum on cart balancing, and wheeled vehicle path following.
翻译:我们提出了一个新的学习控制法方法,在平衡点稳定一个未知的非线性动态系统。我们在一个自我监督的学习环境中制定系统识别任务,共同学习一个控制器和相应的稳定闭路动态假设。在随机控制输入下,未知动态系统的投入-输出行为被用作监督信号,用于培训神经网络系统模型和控制器。拟议方法依靠Lyapunov稳定性理论来产生一个稳定的闭路动态假设和相应的控制法。我们展示了我们处理各种非线性控制问题的方法,如N-链接的钟点平衡和轨迹跟踪、车盘平衡和车轮路跟踪。