We consider the problem of completing a set of $n$ tasks with a human-robot team using minimum effort. In many domains, teaching a robot to be fully autonomous can be counterproductive if there are finitely many tasks to be done. Rather, the optimal strategy is to weigh the cost of teaching a robot and its benefit -- how many new tasks it allows the robot to solve autonomously. We formulate this as a planning problem where the goal is to decide what tasks the robot should do autonomously (act), what tasks should be delegated to a human (delegate) and what tasks the robot should be taught (learn) so as to complete all the given tasks with minimum effort. This planning problem results in a search tree that grows exponentially with $n$ -- making standard graph search algorithms intractable. We address this by converting the problem into a mixed integer program that can be solved efficiently using off-the-shelf solvers with bounds on solution quality. To predict the benefit of learning, we propose a precondition prediction classifier. Given two tasks, this classifier predicts whether a skill trained on one will transfer to the other. Finally, we evaluate our approach on peg insertion and Lego stacking tasks, both in simulation and real-world, showing substantial savings in human effort.
翻译:我们考虑的是与人类机器人团队一起完成一组美元任务的问题。在许多领域,如果需要完成的任务有限,那么教机器人完全自主就会适得其反。相反,最佳战略是权衡教授机器人的费用及其好处 -- -- 机器人可以自主解决多少新任务。我们将此作为一个规划问题,目标是决定机器人应该自主完成哪些任务(行动),哪些任务应该委托给人类(远程),什么任务应该教给机器人(远程),以便尽可能少地完成所有给定的任务。这一规划问题导致一棵搜索树以美元迅速增长 -- -- 使标准图表搜索算法变得难以调和。我们通过将问题转化为混合的整齐化方案来解决,利用有解决方案质量界限的现成溶液来有效解决问题。为了预测学习的好处,我们建议一个先决条件的预测分类员。根据两项任务,这个分类员预测一个机器人所训练的技能是否会转移给另一个任务。最后,我们通过在真实的服务器上展示我们的方法,在真实的存储和图像中展示我们的方法。