Sentiment and lexical analyses are widely used to detect depression or anxiety disorders. It has been documented that there are significant differences in the language used by a person with emotional disorders in comparison to a healthy individual. Still, the effectiveness of these lexical approaches could be improved further because the current analysis focuses on what the social media entries are about, and not how they are written. In this study, we focus on aspects in which these short texts are similar to each other, and how they were created. We present an innovative approach to the depression screening problem by applying Collgram analysis, which is a known effective method of obtaining linguistic information from texts. We compare these results with sentiment analysis based on the BERT architecture. Finally, we create a hybrid model achieving a diagnostic accuracy of 71%.
翻译:感官和词汇分析被广泛用于检测抑郁或焦虑障碍,据文献记载,与健康个人相比,患有情感障碍的人使用的语言差异很大,不过,这些词汇方法的有效性还可以进一步提高,因为目前的分析侧重于社交媒体条目的内容,而不是如何写成。在本研究中,我们侧重于这些短文彼此相似的方面,以及如何创建这些短文。我们通过应用Colgram分析对抑郁筛查问题提出创新办法,这是从文本中获得语言信息的一种已知的有效方法。我们将这些结果与基于BERT结构的情绪分析进行比较。最后,我们创建了一个混合模型,诊断准确度达到71%。