Surprisal theory has provided a unifying framework for understanding many phenomena in sentence processing (Hale, 2001; Levy, 2008a), positing that a word's conditional probability given all prior context fully determines processing difficulty. Problematically for this claim, one local statistic, word frequency, has also been shown to affect processing, even when conditional probability given context is held constant. Here, we ask whether other local statistics have a role in processing, or whether word frequency is a special case. We present the first clear evidence that more complex local statistics, word bigram and trigram probability, also affect processing independently of surprisal. These findings suggest a significant and independent role of local statistics in processing. Further, it motivates research into new generalizations of surprisal that can also explain why local statistical information should have an outsized effect.
翻译:超常理论为理解刑罚处理中的许多现象提供了一个统一框架(Hale,2001年;Levy,2008年a),认为一个单词的有条件概率在以往所有情况下都充分决定了处理困难。 当地的统计数字,一个字频率,也显示出对处理有影响,即使有条件概率在上下文中保持不变。在这里,我们问其它地方统计数字是否在处理中发挥作用,或单词频率是否是一个特殊案例。我们提出了第一个明确证据,即更复杂的本地统计数字,大字和三字概率,也影响处理,而独立于超常。这些结论表明本地统计数字在处理中具有重要和独立的作用。此外,它鼓励研究超常现象的新概括性,这也可以解释为什么本地统计资料应该产生超大效果。