We consider an LQR optimal control problem with partially unknown dynamics. We propose a new model-based online algorithm to obtain an approximation of the dynamics $and$ the control at the same time during a single simulation.
翻译:我们认为LQR最佳控制问题存在部分未知的动态。 我们提出一个新的基于模型的在线算法,以便在单一模拟期间同时获得动态近似美元和美元的控制。