Model-based Reinforcement Learning and Control have demonstrated great potential in various sequential decision making problem domains, including in robotics settings. However, real-world robotics systems often present challenges that limit the applicability of those methods. In particular, we note two problems that jointly happen in many industrial systems: 1) Irregular/asynchronous observations and actions and 2) Dramatic changes in environment dynamics from an episode to another (e.g. varying payload inertial properties). We propose a general framework that overcomes those difficulties by meta-learning adaptive dynamics models for continuous-time prediction and control. We evaluate the proposed approach on two robotic simulations, the first of which is inspired by a real-world industrial robot.
翻译:以模型为基础的强化学习和控制在一系列连续决策问题领域,包括在机器人环境中,显示出巨大的潜力,然而,现实世界机器人系统往往带来挑战,限制了这些方法的适用性,特别是,我们注意到许多工业系统共同出现的两个问题:(1) 不定期/不同步的观察和行动;(2) 环境动态从一个事件到另一个事件发生巨变(例如不同有效载荷惯性特性)。我们提出了一个总框架,通过元学习适应性动态模型来持续预测和控制,克服这些困难。我们评估了两种机器人模拟的拟议方法,其中第一种是真实世界的工业机器人所启发的。