Solving real-world manipulation tasks requires robots to have a repertoire of skills applicable to a wide range of circumstances. When using learning-based methods to acquire such skills, the key challenge is to obtain training data that covers diverse and feasible variations of the task, which often requires non-trivial manual labor and domain knowledge. In this work, we introduce Active Task Randomization (ATR), an approach that learns robust skills through the unsupervised generation of training tasks. ATR selects suitable tasks, which consist of an initial environment state and manipulation goal, for learning robust skills by balancing the diversity and feasibility of the tasks. We propose to predict task diversity and feasibility by jointly learning a compact task representation. The selected tasks are then procedurally generated in simulation using graph-based parameterization. The active selection of these training tasks enables skill policies trained with our framework to robustly handle a diverse range of objects and arrangements at test time. We demonstrate that the learned skills can be composed by a task planner to solve unseen sequential manipulation problems based on visual inputs. Compared to baseline methods, ATR can achieve superior success rates in single-step and sequential manipulation tasks.
翻译:解决现实世界中的操作任务需要机器人具备适用于各种情况的技能库。当使用基于学习的方法来获得这些技能时,关键的挑战是获得涵盖任务的多样和可行变化的训练数据,这通常需要非平凡的手动劳动和领域知识。在这项工作中,我们介绍了主动任务随机化(ATR),一种通过无监督生成训练任务来学习强大技能的方法。ATR通过平衡任务的多样性和可行性来选择合适的任务,这些任务由初始环境状态和操作目标组成。我们建议通过共同学习紧凑的任务表示来预测任务的多样性和可行性。然后使用基于图形的参数化在模拟中程序生成所选任务。这些训练任务的主动选择使得使用我们的框架训练的技能策略能够在测试时强大地处理各种对象和布置。我们证明了学习的技能可以由任务规划器通过视觉输入解决看不见的顺序操作问题。与基线方法相比,ATR可以在单步和顺序操作任务中实现优越的成功率。