TextDescriptives is a Python package for calculating a large variety of statistics from text. It is built on top of spaCy and can be easily integrated into existing workflows. The package has already been used for analysing the linguistic stability of clinical texts, creating features for predicting neuropsychiatric conditions, and analysing linguistic goals of primary school students. This paper describes the package and its features.
翻译:文字描述是一个用于计算文本中大量各种统计数据的Python软件包,它建在垃圾堆之上,很容易融入现有的工作流程。该软件包已经用于分析临床文本的语言稳定性,为预测神经精神病状况创造特征,并分析小学生的语言目标。本文描述了该软件包及其特征。