In his pioneering research, G. K. Zipf formulated a couple of statistical laws on the relationship between the frequency of a word with its number of meanings: the law of meaning distribution, relating the frequency of a word and its frequency rank, and the meaning-frequency law, relating the frequency of a word with its number of meanings. Although these laws were formulated more than half a century ago, they have been only investigated in a few languages. Here we present the first study of these laws in Catalan. We verify these laws in Catalan via the relationship among their exponents and that of the rank-frequency law. We present a new protocol for the analysis of these Zipfian laws that can be extended to other languages. We report the first evidence of two marked regimes for these laws in written language and speech, paralleling the two regimes in Zipf's rank-frequency law in large multi-author corpora discovered in early 2000s. Finally, the implications of these two regimes will be discussed.
翻译:G. K. Zipf在他的开拓性研究中,就一个字及其含义数的频率之间的关系制定了若干统计法:意义分配法,涉及一个词的频率及其频率等级,意义频率法,以及含义-频率法,涉及一个词的频率及其含义数。虽然这些法律是半个多世纪前制定的,但只用几种语言进行了调查。这里我们用加泰罗尼亚语介绍对这些法律的首次研究。我们通过加泰罗尼亚语的推举者和等级-频率法之间的关系来核查这些法律。我们为分析这些西普非语法律提出了一个新的议定书,可以推广到其他语言。我们用书面语言和语言报告这两个法律的明显制度的第一个证据,用大量多著作者公司在2000年代初发现的两种制度。最后,将讨论这两种制度的影响。