Words are fundamental linguistic units that connect thoughts and things through meaning. However, words do not appear independently in a text sequence. The existence of syntactic rules induces correlations among neighboring words. Using an ordinal pattern approach, we present an analysis of lexical statistical connections for 11 major languages. We find that the diverse manners that languages utilize to express word relations give rise to unique pattern structural distributions. Furthermore, fluctuations of these pattern distributions for a given language can allow us to determine both the historical period when the text was written and its author. Taken together, our results emphasize the relevance of ordinal time series analysis in linguistic typology, historical linguistics and stylometry.
翻译:语言是基本语言单位,通过含义将思想和事物连接在一起。但是,文字在文字序列中并不独立出现。 综合规则的存在导致相邻语言之间的相互关系。 使用一个常规模式的方法,我们对11种主要语言的词汇统计联系进行了分析。 我们发现,语言用来表达文字关系的多种方式导致了独特的模式结构分布。 此外,特定语言的这些模式分布的波动使我们能够确定文字写作的历史时期和作者。 综合起来,我们的结果强调了语言类型、历史语言和tytytyllogy 中常规时间序列分析的相关性。</s>