Today's large language models (LLMs) routinely generate coherent, grammatical and seemingly meaningful paragraphs of text. This achievement has led to speculation that these networks are -- or will soon become -- "thinking machines", capable of performing tasks that require abstract knowledge and reasoning. Here, we review the capabilities of LLMs by considering their performance on two different aspects of language use: 'formal linguistic competence', which includes knowledge of rules and patterns of a given language, and 'functional linguistic competence', a host of cognitive abilities required for language understanding and use in the real world. Drawing on evidence from cognitive neuroscience, we show that formal competence in humans relies on specialized language processing mechanisms, whereas functional competence recruits multiple extralinguistic capacities that comprise human thought, such as formal reasoning, world knowledge, situation modeling, and social cognition. In line with this distinction, LLMs show impressive (although imperfect) performance on tasks requiring formal linguistic competence, but fail on many tests requiring functional competence. Based on this evidence, we argue that (1) contemporary LLMs should be taken seriously as models of formal linguistic skills; (2) models that master real-life language use would need to incorporate or develop not only a core language module, but also multiple non-language-specific cognitive capacities required for modeling thought. Overall, a distinction between formal and functional linguistic competence helps clarify the discourse surrounding LLMs' potential and provides a path toward building models that understand and use language in human-like ways.
翻译:今天的大型语言模型(LLMS)通常生成连贯、语法和看似有意义的文字段落。这一成就导致人们猜测这些网络是 -- -- 或不久将成为 -- -- “思维机器”,能够执行需要抽象知识和推理的任务。在这里,我们审查LLMs的能力,考虑它们在语言使用的两个不同方面的表现:“正式语言能力”,其中包括对某种语言规则和模式的了解,以及“功能语言能力”,这是在现实世界中理解和使用语言所需的一系列认知能力。根据认知神经科学的证据,我们表明,人类的正式能力依赖于专门的语言处理机制,而功能能力则征聘了包括人类思想的多种语言外能力,例如正式推理、世界知识、情况建模和社会认知。根据这种区别,LMS在需要正式语言能力的任务上表现出了令人印象深刻(虽然不完善)的成绩,但在许多需要功能能力的测试中却未能达到。我们根据这一证据认为,(1)当代LMSMS应该被严肃地作为正式语言技能的模型;(2)在现实语言和语言模型中使用的多种语言路径上,不需要在正式语言模式中使用某种语言模式上进行区分。