We study online control of an unknown nonlinear dynamical system that is approximated by a time-invariant linear system with model misspecification. Our study focuses on robustness, which measures how much deviation from the assumed linear approximation can be tolerated while maintaining a bounded $\ell_2$-gain compared to the optimal control in hindsight. Some models cannot be stabilized even with perfect knowledge of their coefficients: the robustness is limited by the minimal distance between the assumed dynamics and the set of unstabilizable dynamics. Therefore it is necessary to assume a lower bound on this distance. Under this assumption, and with full observation of the $d$ dimensional state, we describe an efficient controller that attains $\Omega(\frac{1}{\sqrt{d}})$ robustness together with an $\ell_2$-gain whose dimension dependence is near optimal. We also give an inefficient algorithm that attains constant robustness independent of the dimension, with a finite but sub-optimal $\ell_2$-gain.
翻译:我们研究一个未知的非线性动态系统的在线控制,这个系统近似于时间变化线性线性系统,其模型特性有误。 我们的研究侧重于稳健度, 测量在保持约束值$_2美元增益与后视最佳控制相比, 能够容忍多少偏离假设线性近似值。 有些模型即使完全了解其系数, 也无法稳定下来: 稳性受假设动态与一组无法稳定动态之间最小距离的限制。 因此, 必须在这条距离上承担一个较低的约束。 在此假设下, 并充分观察美元维值状态, 我们描述一个高效的控制器, 它能达到 $\\ merga (\ frac{ 1\ tunsqrt{d\\\\\\\\\\\\\\\\\\\\\\ 美元) 美元增益, 其尺寸依赖性接近最佳。 我们还给出一个效率低的算法, 它能取得与维度相独立的恒度的稳性强性, 且具有有限但亚优值$_2美元增值。