Humans can attribute mental states to others, a capacity known as Theory of Mind. However, it is unknown to what extent this ability results from an innate biological endowment or from experience accrued through child development, particularly exposure to language describing others' mental states. We test the viability of the language exposure hypothesis by assessing whether models exposed to large quantities of human language develop evidence of Theory of Mind. In a pre-registered analysis, we present a linguistic version of the False Belief Task, widely used to assess Theory of Mind, to both human participants and a state-of-the-art Large Language Model, GPT-3. Both are sensitive to others' beliefs, but the language model does not perform as well as the humans, nor does it explain the full extent of their behavior, despite being exposed to more language than a human would in a lifetime. This suggests that while language exposure may in part explain how humans develop Theory of Mind, other mechanisms are also responsible.
翻译:人类可以将精神状态归结于他人,一种称为“思想理论”的能力。然而,尚不清楚这种能力在多大程度上来自天生的生物天赋或通过儿童发育积累的经验,特别是接触描述他人精神状态的语言。我们通过评估接触大量人类语言的模型是否发展了精神理论的证据,检验语言暴露假设的可行性。在预先登记的分析中,我们向人类参与者和最先进的大语言模型GPT-3提供了广泛用于评估思想理论的语言版本。两者都对他人的信仰敏感,但语言模型的表现和人的表现都不尽人意,也没有解释其行为的全部程度,尽管在一生中接触的语言多于人的意愿。这说明语言暴露可能部分地解释人类如何发展思想理论,但其他机制也有责任。