Language models have been shown to be very effective in predicting brain recordings of subjects experiencing complex language stimuli. For a deeper understanding of this alignment, it is important to understand the alignment between the detailed processing of linguistic information by the human brain versus language models. In NLP, linguistic probing tasks have revealed a hierarchy of information processing in neural language models that progresses from simple to complex with an increase in depth. On the other hand, in neuroscience, the strongest alignment with high-level language brain regions has consistently been observed in the middle layers. These findings leave an open question as to what linguistic information actually underlies the observed alignment between brains and language models. We investigate this question via a direct approach, in which we eliminate information related to specific linguistic properties in the language model representations and observe how this intervention affects the alignment with fMRI brain recordings obtained while participants listened to a story. We investigate a range of linguistic properties (surface, syntactic and semantic) and find that the elimination of each one results in a significant decrease in brain alignment across all layers of a language model. These findings provide direct evidence for the role of specific linguistic information in the alignment between brain and language models, and opens new avenues for mapping the joint information processing in both systems.
翻译:语言模型显示,语言模型在预测有复杂语言刺激因素的科目的大脑记录方面非常有效。为了更深入地理解这一一致性,必须理解人类大脑对语言信息的详细处理与语言模型之间的一致性。在语言模型中,语言测试任务揭示出神经语言模型信息处理的等级分级,这些模型从简单到复杂,随着深度的提高而发展。另一方面,在神经科学中,一直观察到与高语言大脑区域最紧密的一致在中层。这些发现留下了一个未决问题,即观察到的大脑和语言模型之间对齐实际上是什么语言信息。我们通过直接的方法来调查这一问题,在语言模型中消除与特定语言特性有关的信息,并观察这一干预如何影响与参与者聆听故事时获得的FMRI脑记录的一致性。我们调查了语言特性的范围(表层、合成和语义),发现消除每一种语言特性导致新语言模型各个层次的大脑对齐度显著下降。这些发现直接证明具体语言信息在脑模型和语言模型之间联合处理中的作用,打开了大脑和语言模型之间的联合处理渠道。