As environments involving both robots and humans become increasingly common, so does the need to account for people during planning. To plan effectively, robots must be able to respond to and sometimes influence what humans do. This requires a human model which predicts future human actions. A simple model may assume the human will continue what they did previously; a more complex one might predict that the human will act optimally, disregarding the robot; whereas an even more complex one might capture the robot's ability to influence the human. These models make different trade-offs between computational time and performance of the resulting robot plan. Using only one model of the human either wastes computational resources or is unable to handle critical situations. In this work, we give the robot access to a suite of human models and enable it to assess the performance-computation trade-off online. By estimating how an alternate model could improve human prediction and how that may translate to performance gain, the robot can dynamically switch human models whenever the additional computation is justified. Our experiments in a driving simulator showcase how the robot can achieve performance comparable to always using the best human model, but with greatly reduced computation.
翻译:随着涉及机器人和人类的环境越来越普遍,因此在规划期间需要为人负责。为了有效规划,机器人必须能够应对,有时影响人类的行为。这需要人类模型来预测未来人类的行为。一个简单的模型可以假设人类将继续像以前那样行事;一个更复杂的模型可以预测,人类将最佳地采取行动,无视机器人;而更为复杂的模型可以捕捉机器人影响人类的能力。这些模型在计算时间和所产生机器人计划的性能之间作出不同的权衡。只使用人类的模型,要么浪费计算资源,要么无法处理危急情况。在这项工作中,我们让机器人有机会使用一套人类模型,使其能够在线评估性能-计算交易。通过估计替代模型如何改进人类的预测,以及这如何转化为业绩收益,机器人可以在有理由进行额外计算时以动态方式转换人类模型。我们在驱动模拟器中的实验显示机器人的性能如何与始终使用最佳的人类模型相比,但计算却大大减少。