In this work, we consider the task of improving the accuracy of dynamic models for model predictive control (MPC) in an online setting. Even though prediction models can be learned and applied to model-based controllers, these models are often learned offline. In this offline setting, training data is first collected and a prediction model is learned through an elaborated training procedure. After the model is trained to a desired accuracy, it is then deployed in a model predictive controller. However, since the model is learned offline, it does not adapt to disturbances or model errors observed during deployment. To improve the adaptiveness of the model and the controller, we propose an online dynamics learning framework that continually improves the accuracy of the dynamic model during deployment. We adopt knowledge-based neural ordinary differential equations (KNODE) as the dynamic models, and use techniques inspired by transfer learning to continually improve the model accuracy. We demonstrate the efficacy of our framework with a quadrotor robot, and verify the framework in both simulations and physical experiments. Results show that the proposed approach is able to account for disturbances that are possibly time-varying, while maintaining good trajectory tracking performance.
翻译:在这项工作中,我们考虑在网上环境中提高模型预测控制动态模型(MPC)准确性的任务。尽管可以学习预测模型并将其应用于基于模型的控制器,但这些模型往往是离线学习的。在这一离线设置中,首先收集培训数据,然后通过精心制定的培训程序学习预测模型。在模型经过预期准确性的培训后,然后将其安装在模型预测控制器中。然而,由于模型是离线学习的,它不能适应在部署期间观察到的扰动或模型错误。为了改进模型和控制器的适应性,我们提议了一个在线动态学习框架,不断提高动态模型的准确性。我们采用了基于知识的神经普通差异方程式(KODE)作为动态模型,并使用转移学习所启发的技术来不断提高模型的准确性。我们用一个 quadrtortoror机器人展示了我们框架的功效,并在模拟和物理实验中验证了框架。结果显示,拟议的方法能够对可能发生时间变化的干扰进行核算,同时保持良好的轨迹跟踪业绩。