A majority of microrobots are constructed using compliant materials that are difficult to model analytically, limiting the utility of traditional model-based controllers. Challenges in data collection on microrobots and large errors between simulated models and real robots make current model-based learning and sim-to-real transfer methods difficult to apply. We propose a novel framework residual model learning (RML) that leverages approximate models to substantially reduce the sample complexity associated with learning an accurate robot model. We show that using RML, we can learn a model of the Harvard Ambulatory MicroRobot (HAMR) using just 12 seconds of passively collected interaction data. The learned model is accurate enough to be leveraged as "proxy-simulator" for learning walking and turning behaviors using model-free reinforcement learning algorithms. RML provides a general framework for learning from extremely small amounts of interaction data, and our experiments with HAMR clearly demonstrate that RML substantially outperforms existing techniques.
翻译:多数微型机器人是使用难以进行分析模拟的符合标准材料建造的,这限制了传统模型控制器的效用。在收集微机器人数据方面存在挑战,模拟模型与实际机器人之间也存在大错误,因此难以应用当前基于模型的学习和模拟到真实传输方法。我们提议建立一个新颖的框架剩余模型学习(RML),利用大约的模型来大幅降低与学习精确机器人模型有关的样本复杂性。我们显示,使用RML,我们就可以仅仅使用12秒被动收集的互动数据来学习哈佛气压微机器人(HAMR)的模型。所学的模型十分准确,足以用作“代理模拟器”学习行走和转换行为,使用无模型强化学习算法。RML为从极小的互动数据中学习提供了一个总体框架,我们与HMR的实验清楚地表明,RML大大超越了现有技术。