持续机器人学习的行为自组织支助任务推论 (Behavior Self-Organization Supports Task Inference for Continual Robot Learning)

Recent advances in robot learning have enabled robots to become increasingly better at mastering a predefined set of tasks. On the other hand, as humans, we have the ability to learn a growing set of tasks over our lifetime. Continual robot learning is an emerging research direction with the goal of endowing robots with this ability. In order to learn new tasks over time, the robot first needs to infer the task at hand. Task inference, however, has received little attention in the multi-task learning literature. In this paper, we propose a novel approach to continual learning of robotic control tasks. Our approach performs unsupervised learning of behavior embeddings by incrementally self-organizing demonstrated behaviors. Task inference is made by finding the nearest behavior embedding to a demonstrated behavior, which is used together with the environment state as input to a multi-task policy trained with reinforcement learning to optimize performance over tasks. Unlike previous approaches, our approach makes no assumptions about task distribution and requires no task exploration to infer tasks. We evaluate our approach in experiments with concurrently and sequentially presented tasks and show that it outperforms other multi-task learning approaches in terms of generalization performance and convergence speed, particularly in the continual learning setting.

翻译：在机器人学习方面最近的进展使得机器人在掌握一套预先确定的任务方面变得越来越好。另一方面,作为人类,我们有能力在一生中学习越来越多的任务。连续的机器人学习是一个新兴的研究方向,目的是赋予机器人以这种能力。为了学习新的任务,机器人首先需要推断手头的任务。然而,任务推论在多任务学习文献中很少受到注意。在本文件中,我们提出一个不断学习机器人控制任务的新办法。我们的方法通过渐进式自我组织示范的行为,对行为嵌入进行不受监督的学习。任务推论是通过找到最接近的行为嵌入一种被证明的行为,与环境状态一起使用,作为经过强化学习的多任务政策的投入,以优化任务的业绩。与以往的做法不同,我们的方法对任务分配不作任何假设,不需要任务推导任务。我们的方法是对同时和顺序提出的任务的实验方法进行评估,并显示它超越了在一般工作中学习其他持续性能趋同的速度。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【布朗大学David Abel博士论文】强化学习抽象理论，297页pdf

专知会员服务

89+阅读 · 2020年12月17日

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ICML2020】多视角对比图表示学习，Contrastive Multi-View GRL

专知会员服务

80+阅读 · 2020年6月11日

史上机器学习 &深度学习课程大合集，一站搞定，Deep Learning Drizzle

专知会员服务

176+阅读 · 2020年5月10日