Model-based Reinforcement Learning and Control have demonstrated great potential in various sequential decision making problem domains, including in robotics settings. However, real-world robotics systems often present challenges that limit the applicability of those methods. In particular, we note two problems that jointly happen in many industrial systems: 1) Irregular/asynchronous observations and actions and 2) Dramatic changes in environment dynamics from an episode to another (e.g. varying payload inertial properties). We propose a general framework that overcomes those difficulties by meta-learning adaptive dynamics models for continuous-time prediction and control. We present evaluations in two different robot simulations and on a real industrial robot.
翻译:以模型为基础的强化学习和控制在一系列相继决策问题领域,包括在机器人环境中,显示出巨大的潜力,然而,现实世界机器人系统往往带来挑战,限制了这些方法的适用性,特别是,我们注意到许多工业系统共同出现的两个问题:(1) 不规则/不同步的观测和行动;(2) 环境动态从一个事件到另一个事件发生巨变(例如,不同有效载荷惯性特性);我们提出了一个总体框架,通过元学习适应性动态模型来持续预测和控制,克服这些困难。我们用两种不同的机器人模拟和一种真正的工业机器人来进行评估。