Due to the worldwide accessibility to the Internet along with the continuous advances in mobile technologies, physical and digital worlds have become completely blended, and the proliferation of social media platforms has taken a leading role over this evolution. In this paper, we undertake a thorough analysis towards better visualising and understanding the factors that characterise and differentiate social media users affected by mental disorders. We perform different experiments studying multiple dimensions of language, including vocabulary uniqueness, word usage, linguistic style, psychometric attributes, emotions' co-occurrence patterns, and online behavioural traits, including social engagement and posting trends. Our findings reveal significant differences on the use of function words, such as adverbs and verb tense, and topic-specific vocabulary, such as biological processes. As for emotional expression, we observe that affected users tend to share emotions more regularly than control individuals on average. Overall, the monthly posting variance of the affected groups is higher than the control groups. Moreover, we found evidence suggesting that language use on micro-blogging platforms is less distinguishable for users who have a mental disorder than other less restrictive platforms. In particular, we observe on Twitter less quantifiable differences between affected and control groups compared to Reddit.
翻译:由于全世界可以使用互联网,以及移动技术不断进步,物理和数字世界已经完全混合在一起,社交媒体平台的泛滥在这一演变中起到了主导作用。在本文件中,我们进行了透彻的分析,以便更好地了解和理解受精神障碍影响的社交媒体用户的特点和区别。我们进行了不同的实验,研究语言的多个层面,包括词汇独特性、文字使用、语言风格、心理特征、情感共发模式和在线行为特征,包括社会参与和张贴趋势。我们的调查结果显示,在使用诸如动词和动词紧张等功能词以及生物过程等专题词汇方面存在重大差异。关于情感表达,我们观察到,受影响用户往往比平均控制个人更经常地分享情绪。总体而言,受影响群体的每月张贴差异高于控制群体。此外,我们发现有证据表明,对于精神障碍的用户而言,微博平台上的语言使用比其他限制较少的平台更难辨别。我们特别在Twitter上观察到,受影响群体与控制群体之间的可量化差异比红化群体要小。