In this study we developed an automated system that evaluates speech and language features from audio recordings of neuropsychological examinations of 92 subjects in the Framingham Heart Study. A total of 265 features were used in an elastic-net regularized binomial logistic regression model to classify the presence of cognitive impairment, and to select the most predictive features. We compared performance with a demographic model from 6,258 subjects in the greater study cohort (0.79 AUC), and found that a system that incorporated both audio and text features performed the best (0.92 AUC), with a True Positive Rate of 29% (at 0% False Positive Rate) and a good model fit (Hosmer-Lemeshow test > 0.05). We also found that decreasing pitch and jitter, shorter segments of speech, and responses phrased as questions were positively associated with cognitive impairment.
翻译:在这项研究中,我们开发了一个自动化系统,评价Framingham心脏研究92个科目的神经心理学检查录音中的言语和语言特征;在弹性网正规化的二进制后勤回归模型中共使用了265个特征,用于对认知缺陷的存在进行分类,并选择了最预测的特征;我们从较大研究组的6,258个科目(0.79 澳洲分校)中将性能与人口模型进行比较,发现一个包含音频和文字特征的系统表现得最佳(0.92澳洲分校),其真实正率为29%(为0%假正正率),而且模型适合(Hosmer-Lemeshow测试 > 0.05),我们还发现,音频和音速下降、语音短段和答复的表达方式与认知缺陷有着积极的联系。