Like visual processing, language processing is susceptible to illusions in which people systematically misperceive stimuli. In one such case--the comparative illusion (CI), e.g., More students have been to Russia than I have--comprehenders tend to judge the sentence as acceptable despite its underlying nonsensical comparison. Prior research has argued that this phenomenon can be explained as Bayesian inference over a noisy channel: the posterior probability of an interpretation of a sentence is proportional to both the prior probability of that interpretation and the likelihood of corruption into the observed (CI) sentence. Initial behavioral work has supported this claim by evaluating a narrow set of alternative interpretations of CI sentences and showing that comprehenders favor interpretations that are more likely to have been corrupted into the illusory sentence. In this study, we replicate and go substantially beyond this earlier work by directly predicting the strength of illusion with a quantitative model of the posterior probability of plausible interpretations, which we derive through a novel synthesis of statistical language models with human behavioral data. Our model explains not only the fine gradations in the strength of CI effects, but also a previously unexplained effect caused by pronominal vs. full noun phrase than-clause subjects. These findings support a noisy-channel theory of sentence comprehension by demonstrating that the theory makes novel predictions about the comparative illusion that bear out empirically. This outcome joins related evidence of noisy channel processing in both illusory and non-illusory contexts to support noisy channel inference as a unified computational-level theory of diverse language processing phenomena.
翻译:与视觉处理类似,语言处理也容易受到错觉的影响,即人们会系统性地误解刺激。在比较性错觉(CI)这一案例中(例如“More students have been to Russia than I have”),理解者倾向于认为这类句子可接受,尽管其隐含的比较关系毫无意义。先前研究认为,这一现象可通过噪声信道上的贝叶斯推断来解释:句子某一解释的后验概率,与该解释的先验概率以及其被噪声干扰成观测到的CI句子的似然度成正比。早期的行为研究通过评估CI句子的有限替代解释集,并表明理解者更倾向于那些更可能被干扰成错觉句子的解释,从而支持了这一主张。在本研究中,我们不仅复现了先前工作,还通过将统计语言模型与人类行为数据创新性结合,推导出对合理解释后验概率的定量模型,直接预测错觉强度,从而显著超越了先前研究。我们的模型不仅解释了CI效应的精细强度分级,还解释了先前未能解释的由代词性vs.完整名词短语作为than从句主语所引发的效应。这些发现通过证明该理论对比较性错觉做出了新颖且经实证验证的预测,支持了句子理解的噪声信道理论。这一结果与错觉和非错觉语境中噪声信道处理的相关证据相结合,共同支持噪声信道推断作为一种统一的计算层面理论,用以解释多样的语言处理现象。