Knowledge and skills can transfer from human teachers to human students. However, such direct transfer is often not scalable for physical tasks, as they require one-to-one interaction, and human teachers are not available in sufficient numbers. Machine learning enables robots to become experts and play the role of teachers to help in this situation. In this work, we formalize cooperative robot teaching as a Markov game, consisting of four key elements: the target task, the student model, the teacher model, and the interactive teaching-learning process. Under a moderate assumption, the Markov game reduces to a partially observable Markov decision process, with an efficient approximate solution. We illustrate our approach on two cooperative tasks, one in a simulated video game and one with a real robot.
翻译:然而,这种直接转让往往无法用于实际任务,因为它们需要一对一的互动,而且没有足够的人教师。机器学习使机器人能够成为专家,并在这种情形下发挥教师的作用。在这项工作中,我们把合作机器人教学正规化为马科夫游戏,由四个关键要素组成:目标任务、学生模式、教师模式和交互式教学学习过程。在适度假设下,马尔科夫游戏减少为部分可观测到的马尔科夫决策过程,并有一个高效的近似解决方案。我们举例说明了我们在两项合作任务上的做法,一项是模拟视频游戏,另一项是真正的机器人。