With recent developments in Social Computing, Natural Language Processing and Clinical Psychology, the social NLP research community addresses the challenge of automation in mental illness on social media. A recent extension to the problem of multi-class classification of mental health issues is to identify the cause behind the user's intention. However, multi-class causal categorization for mental health issues on social media has a major challenge of wrong prediction due to the overlapping problem of causal explanations. There are two possible mitigation techniques to solve this problem: (i) Inconsistency among causal explanations/ inappropriate human-annotated inferences in the dataset, (ii) in-depth analysis of arguments and stances in self-reported text using discourse analysis. In this research work, we hypothesise that if there exists the inconsistency among F1 scores of different classes, there must be inconsistency among corresponding causal explanations as well. In this task, we fine tune the classifiers and find explanations for multi-class causal categorization of mental illness on social media with LIME and Integrated Gradient (IG) methods. We test our methods with CAMS dataset and validate with annotated interpretations. A key contribution of this research work is to find the reason behind inconsistency in accuracy of multi-class causal categorization. The effectiveness of our methods is evident with the results obtained having category-wise average scores of $81.29 \%$ and $0.906$ using cosine similarity and word mover's distance, respectively.
翻译:随着社会计算、自然语言处理和临床心理学的最新发展,社会国家语言方案研究界应对社会媒体上精神疾病自动化的挑战。最近,心理健康问题多级分类问题的一个延伸是查明用户意图背后的原因。然而,社会媒体对心理健康问题的多级因果分类由于因果关系解释问题重叠,面临错误预测的重大挑战。有两种可能的缓解技术可以解决这个问题:(一) 数据集中因果解释不一/不适当的人为附加说明的推论不一致;(二) 利用谈话分析深入分析自我报告文本中的论据和立场。在这一研究工作中,我们假设,如果不同类别F1之间有不一致的原因,那么在相应的因果解释之间也必然存在不一致。在这项任务中,我们精细调整分类人员,寻找关于社会媒体与LIME和综合重力(IG)方法的多级因果分类的解释。我们用CAM数据集测试我们的方法,并用注释性的解释校验。我们研究的一个关键贡献是,如果不同类别F1的分数,那么,我们平均的等级分析结果的等级分析结果就会明显地在8美元之后。