Real-time adaptation is imperative to the control of robots operating in complex, dynamic environments. Adaptive control laws can endow even nonlinear systems with good trajectory tracking performance, provided that any uncertain dynamics terms are linearly parameterizable with known nonlinear features. However, it is often difficult to specify such features a priori, such as for aerodynamic disturbances on rotorcraft or interaction forces between a manipulator arm and various objects. In this paper, we turn to data-driven modeling with neural networks to learn, offline from past data, an adaptive controller with an internal parametric model of these nonlinear features. Our key insight is that we can better prepare the controller for deployment with control-oriented meta-learning of features in closed-loop simulation, rather than regression-oriented meta-learning of features to fit input-output data. Specifically, we meta-learn the adaptive controller with closed-loop tracking simulation as the base-learner and the average tracking error as the meta-objective. With a nonlinear planar rotorcraft subject to wind, we demonstrate that our adaptive controller outperforms other controllers trained with regression-oriented meta-learning when deployed in closed-loop for trajectory tracking control.
翻译:实时适应对于控制在复杂、动态环境中运行的机器人至关重要。 适应性控制法甚至可以使非线性系统具有良好的轨迹跟踪性能,只要任何不确定的动态术语都是线性参数,可以使用已知的非线性特征进行线性参数。 但是,通常很难先验地指定此类特征,例如转动器中的空气动力干扰或操纵器臂和各种物体之间的相互作用。 在本文件中,我们转向由神经网络驱动的数据驱动模型,以便从过去的数据中脱线地学习具有这些非线性特征的内部参数模型的适应性控制器。 我们的主要洞察力是,我们可以更好地为控制器部署以控制为主的闭路模拟功能,而不是以回归为主的元数据。 具体地说,我们用闭路跟踪模拟将适应性控制器作为基 Learner 和平均跟踪错误作为元目标。 我们用非线性计划性控制器作为风向的非线性循环模型,我们证明我们的适应性控制器在部署的轨迹上超越了其他以回归式控制器的轨迹。