In this paper, discrete linear quadratic regulator (DLQR) and iterative linear quadratic regulator (ILQR) methods based on high-order Runge-Kutta (RK) discretization are proposed for solving linear and nonlinear quadratic optimal control problems respectively. As discovered in [W. Hager, Runge-Kutta method in optimal control and the discrete adjoint system, Numer. Math.,2000, pp. 247-282], direct approach with RK discretization is equivalent with indirect approach based on symplectic partitioned Runge-Kutta (SPRK) integration. In this paper, we will reconstruct this equivalence by the analogue of continuous and discrete dynamic programming. Then, based on the equivalence, we discuss the issue that the internal-stage controls produced by direct approach may have lower order accuracy than the RK method used. We propose order conditions for internal-stage controls and then demonstrate that third or fourth order explicit RK discretization cannot avoid the order reduction phenomenon. To overcome this obstacle, we calculate node control instead of internal-stage controls in DLQR and ILQR methods. And numerical examples will illustrate the validity of our methods. Another advantage of our methods is high computational efficiency which comes from the usage of feedback technique. In this paper, we also demonstrate that ILQR is essentially a quasi-Newton method with linear convergence rate.
翻译:在本文中,基于高阶龙格-库塔(RK)的离散线性线性二次调控器(DLQR)和迭代线性二次调控器(ILQR)方法(ILQR)建议分别用于解决线性和非线性二次优化控制问题。正如在[W. Hager, 龙格-库塔最佳控制和离散连接系统(Numer. Math., 2000, pp. 247-282)中发现的那样,离散的离散离散直接方法与基于间隔绝的龙格-库塔(SPRK)整合的间接方法(ILQR)方法相同。在本文中,我们将通过连续和离散动态的动态编程程序模拟来重建这种等同。然后,在等同的基础上,我们讨论一个问题,即直接方法产生的内部控制阶段控制与离散连接系统(Numeral-L)系统之间的定序控制方法可能比使用的更低顺序精度。 我们的IL-R 和IQ的直线性计算方法将用来说明我们IL 的精确的精确方法。