It is well-known that, when sufficiently young children encounter a new word, they tend to attach it to a meaning that does not have a word yet in their lexicon. In previous research, the strategy was shown to be optimal from an information theoretic standpoint. However, the information theoretic model employed neither explains the weakening of that vocabulary learning bias in older children or polylinguals nor reproduces Zipf's meaning-frequency law, namely the non-linear relationship between the number of meanings of a word and its frequency. Here we consider a generalization of the model that is channeled to reproduce that law. The analysis of the new model reveals regions of the phase space where the bias disappears consistently with the weakening or loss of the bias in older children or polylinguals. In the deep learning era, the model is a transparent low-dimensional tool for future experimental research and illustrates the predictive power of a theoretical framework originally designed to shed light on the origins of Zipf's rank-frequency law.
翻译:众所周知,当足够年幼的儿童遇到一个新词时,他们往往会把它附加到在他们的词汇中还没有一个词的含义上。在以前的研究中,战略从信息理论的角度显示是最佳的。然而,所使用的信息理论模型既没有解释该词汇在年长儿童或多种语言中学习偏见的削弱,也没有解释Zipf的含义频率法,即一个词的含义数量与其频率之间的非线性关系。我们在这里考虑将用于复制该法律的模型加以概括。对新模型的分析揭示了阶段空间的各个区域,在这些区域,偏见随着年长儿童或多种语言偏见的削弱或丧失而不断消失。在深层次的学习时代,该模型是未来实验研究的一个透明的低维工具,并说明了最初旨在揭示Zipf等级-频率法起源的理论框架的预测力。