Deep learning has recently made remarkable progress in natural language processing. Yet, the resulting algorithms remain far from competing with the language abilities of the human brain. Predictive coding theory offers a potential explanation to this discrepancy: while deep language algorithms are optimized to predict adjacent words, the human brain would be tuned to make long-range and hierarchical predictions. To test this hypothesis, we analyze the fMRI brain signals of 304 subjects each listening to 70min of short stories. After confirming that the activations of deep language algorithms linearly map onto those of the brain, we show that enhancing these models with long-range forecast representations improves their brain-mapping. The results further reveal a hierarchy of predictions in the brain, whereby the fronto-parietal cortices forecast more abstract and more distant representations than the temporal cortices. Overall, this study strengthens predictive coding theory and suggests a critical role of long-range and hierarchical predictions in natural language processing.
翻译:最近,在自然语言处理方面,深层次的学习取得了显著的进步。然而,由此产生的算法仍然远远没有与人类大脑的语言能力相竞争。预测编码理论为这一差异提供了潜在的解释:虽然深语言算法被优化以预测相邻的单词,但人类大脑将被调整以作出远程和等级预测。为了检验这一假设,我们分析了304个对象的FMRI大脑信号,每个对象听70分钟短故事。在确认深语言算法的直线映射在大脑的直线映射后,我们发现,用长距离的预测显示来强化这些模型可以改善大脑的图象。结果进一步揭示了大脑中预测的等级,即前方-paricontical cotices 预测的抽象和远处的表达比时间曲线。总体来说,这项研究加强了预测编码理论,并提出了长距离和分级预测在自然语言处理中的关键性作用。