AI can not only outperform people in many planning tasks, but also teach them how to plan better. All prior work was conducted in fully observable environments, but the real world is only partially observable. To bridge this gap, we developed the first metareasoning algorithm for discovering resource-rational strategies for human planning in partially observable environments. Moreover, we developed an intelligent tutor teaching the automatically discovered strategy by giving people feedback on how they plan in increasingly more difficult problems. We showed that our strategy discovery method is superior to the state-of-the-art and tested our intelligent tutor in a preregistered training experiment with 330 participants. The experiment showed that people's intuitive strategies for planning in partially observable environments are highly suboptimal, but can be substantially improved by training with our intelligent tutor. This suggests our human-centred tutoring approach can successfully boost human planning in complex, partially observable sequential decision problems.
翻译:AI不仅可以在许多规划任务中比人们表现更好,而且可以教他们如何进行更好的规划。以前的所有工作都是在完全可见的环境中进行的,但现实世界只是部分可见的。为了弥合这一差距,我们开发了第一个在部分可见环境中发现人类规划资源合理战略的元化算法。此外,我们开发了一个智能导师来教授自动发现的战略,向人们反馈他们如何在日益困难的问题中进行规划。我们显示,我们的战略发现方法优于最先进的方法,并在一个预先登记的训练实验中测试了我们的智能导师,有330名参与者参加。实验表明,在部分可见环境中进行规划的直观策略非常不理想,但可以通过与我们的智能导师一起培训大大改进。这表明,我们的以人为中心的辅导方法可以在复杂、部分可见的相继决策问题上成功地促进人类的规划。