Precise motion planning and control require accurate models which are often difficult, expensive, or time-consuming to obtain. Online model learning is an attractive approach that can handle model variations while achieving the desired level of performance. However, most model learning methods developed within adaptive nonlinear control are limited to certain types of uncertainties, called matched uncertainties. This work presents an adaptive control framework for nonlinear systems with unmatched uncertainties that addresses several of the limitations of existing methods through two key innovations. The first is leveraging contraction theory and a new type of contraction metric that, when coupled with an adaptation law, is able to track feasible trajectories generated by an adapting reference model. The second is a modulation of the learning rate so the closed-loop system remains stable during learning transients. The proposed approach is more general than existing methods as it is able to handle unmatched uncertainties while only requiring the system be nominally contracting in closed-loop. Additionally, it can be used with learned feedback policies that are known to be contracting in some metric, facilitating transfer learning and bridging the sim2real gap. Simulation results demonstrate the effectiveness of the method.
翻译:精确的动作规划和控制需要精确的模型,这些模型往往困难、昂贵或耗费时间才能获得。在线模型学习是一种有吸引力的方法,既能处理模型变异,又能达到理想的绩效水平。然而,在适应性非线性控制下开发的大多数示范学习方法仅限于某些类型的不确定性,称之为匹配的不确定性。这项工作为非线性系统提供了一个适应性控制框架,这些系统具有不匹配的不确定性,通过两项关键创新解决了现有方法的若干局限性。第一个是利用收缩理论和一种新的收缩度标准,在适应性法律的配合下,能够跟踪适应性参考模型产生的可行的轨迹。第二个是调整学习率,这样闭环系统在学习中保持稳定。拟议方法比现有方法更为笼统,因为它能够处理不匹配的不确定性,而只是要求系统在名义上以闭环方式订约。此外,还可以利用学习到的反馈政策,这种政策在某种指标中已知是承包的,有助于转让学习和弥合模版差距。模拟结果显示方法的有效性。