Social media has become a valuable resource for the study of suicidal ideation and the assessment of suicide risk. Among social media platforms, Reddit has emerged as the most promising one due to its anonymity and its focus on topic-based communities (subreddits) that can be indicative of someone's state of mind or interest regarding mental health disorders such as r/SuicideWatch, r/Anxiety, r/depression. A challenge for previous work on suicide risk assessment has been the small amount of labeled data. We propose an empirical investigation into several classes of weakly-supervised approaches, and show that using pseudo-labeling based on related issues around mental health (e.g., anxiety, depression) helps improve model performance for suicide risk assessment.
翻译:社交媒体已成为研究自杀思想和评估自杀风险的宝贵资源,在社交媒体平台中,Reddit因其匿名性及其对基于主题的社区(子公司)的关注而成为最有希望的平台,这可以表明某人对心理健康疾病(如r/Suides Watch、r/Antical、r/Antical、r/depression)的心理状态或兴趣。以往的自杀风险评估工作面临的一个挑战是贴有标签的数据数量少。我们建议对几类监管不力的方法进行实证调查,并表明使用基于心理健康相关问题(如焦虑、抑郁症)的假标签有助于改进自杀风险评估的示范性表现。