When inferring the goals that others are trying to achieve, people intuitively understand that others might make mistakes along the way. This is crucial for activities such as teaching, offering assistance, and deciding between blame or forgiveness. However, Bayesian models of theory of mind have generally not accounted for these mistakes, instead modeling agents as mostly optimal in achieving their goals. As a result, they are unable to explain phenomena like locking oneself out of one's house, or losing a game of chess. Here, we extend the Bayesian Theory of Mind framework to model boundedly rational agents who may have mistaken goals, plans, and actions. We formalize this by modeling agents as probabilistic programs, where goals may be confused with semantically similar states, plans may be misguided due to resource-bounded planning, and actions may be unintended due to execution errors. We present experiments eliciting human goal inferences in two domains: (i) a gridworld puzzle with gems locked behind doors, and (ii) a block-stacking domain. Our model better explains human inferences than alternatives, while generalizing across domains. These findings indicate the importance of modeling others as bounded agents, in order to account for the full richness of human intuitive psychology.
翻译:当人们推断其他人试图实现的目标时,人们直觉地理解其他人可能会在前进过程中犯错误。这对于教学、提供援助、在指责或宽恕之间作出决定等活动至关重要。然而,贝叶斯思想理论模型通常没有考虑到这些错误,而是模拟代理人,在实现其目标方面最理想的模型。因此,他们无法解释将自己锁在家中之外或失去棋棋游戏等现象。在这里,我们扩展了巴耶斯思想理论框架,以模拟可能错误目标、计划和行动的有界限的理性代理人。我们通过模拟代理人将这种模型正式确定为概率性方案,其中目标可能与类似的国家混淆,计划可能因资源限制的规划而误入歧途,而行动可能因执行错误而无意。我们提出实验,在两个领域得出人类目标的推论:(一) 锁在门后面的宝石的网格世界迷,以及(二) 块拆领域。我们的模型比替代模型更好地解释人类的推理,同时将目标与完全的心理定位在跨区域进行概括。这些实验显示人类的模型的重要性。这些结果显示,在对其它领域进行彻底的推理的重要性。