Creating robots that can perform general-purpose service tasks in a human-populated environment has been a longstanding grand challenge for AI and Robotics research. One particularly valuable skill that is relevant to a wide variety of tasks is the ability to locate and retrieve objects upon request. This paper models this skill as a Scavenger Hunt (SH) game, which we formulate as a variation of the NP-hard stochastic traveling purchaser problem. In this problem, the goal is to find a set of objects as quickly as possible, given probability distributions of where they may be found. We investigate the performance of several solution algorithms for the SH problem, both in simulation and on a real mobile robot. We use Reinforcement Learning (RL) to train an agent to plan a minimal cost path, and show that the RL agent can outperform a range of heuristic algorithms, achieving near optimal performance. In order to stimulate research on this problem, we introduce a publicly available software stack and associated website that enable users to upload scavenger hunts which robots can download, perform, and learn from to continually improve their performance on future hunts.
翻译:创建能够在人类居住环境中执行通用服务任务的机器人,一直是AI和机器人研究面临的一项长期重大挑战。与多种任务相关的一项特别宝贵的技能是能够根据请求找到和检索对象。本文将这种技能作为Schavenger Hunt (SH) 游戏模型,我们把它作为NP-hard Stochestic旅行购买者问题的一种变异。在这个问题中,目标是尽快找到一套物体,考虑到它们可能找到的地点的概率分布。我们调查了SH问题的若干解决方案算法的性能,包括在模拟和真正的移动机器人上。我们使用加强学习(RL)来训练一个代理来规划一个最低成本路径,并显示RL代理可以超越一系列超自然算法,从而实现接近最佳的性能。为了刺激对这一问题的研究,我们引入了一个公开可用的软件堆放和相关的网站,使用户能够上可下载、操作和学习机器人在未来狩猎中如何不断改进的搜索功能。