Moving away from repetitive tasks, robots nowadays demand versatile skills that adapt to different situations. Task-parameterized learning improves the generalization of motion policies by encoding relevant contextual information in the task parameters, hence enabling flexible task executions. However, training such a policy often requires collecting multiple demonstrations in different situations. To comprehensively create different situations is non-trivial thus renders the method less applicable to real-world problems. Therefore, training with fewer demonstrations/situations is desirable. This paper presents a novel concept to augment the original training dataset with synthetic data for policy improvements, thus allows learning task-parameterized skills with few demonstrations.
翻译:如今,机器人摆脱了重复性任务,需要适应不同情况的多种技能。任务参数化学习通过在任务参数中将相关背景信息编码,从而能够灵活执行任务,改善了运动政策的普遍性。然而,培训这种政策往往需要收集不同情况下的多重演示。全面创造不同的情况并非三重性,因此,这种方法对现实世界问题的适用性较低。因此,培训示范/情况较少是可取的。本文件提出了一个新概念,用合成数据来补充最初的培训数据集,以便改进政策,从而使得学习任务参数化的技能能够以少量演示为基础。