Automated methods have been widely used to identify and analyze mental health conditions (e.g., depression) from various sources of information, including social media. Yet, deployment of such models in real-world healthcare applications faces challenges including poor out-of-domain generalization and lack of trust in black box models. In this work, we propose approaches for depression detection that are constrained to different degrees by the presence of symptoms described in PHQ9, a questionnaire used by clinicians in the depression screening process. In dataset-transfer experiments on three social media datasets, we find that grounding the model in PHQ9's symptoms substantially improves its ability to generalize to out-of-distribution data compared to a standard BERT-based approach. Furthermore, this approach can still perform competitively on in-domain data. These results and our qualitative analyses suggest that grounding model predictions in clinically-relevant symptoms can improve generalizability while producing a model that is easier to inspect.
翻译:各种信息来源(包括社交媒体)广泛使用自动化方法来识别和分析心理健康状况(例如抑郁症),包括社交媒体;然而,在现实世界的医疗保健应用中采用这种模式面临挑战,包括外向外的概括化不力和对黑盒模型缺乏信任等;在这项工作中,我们提出了受PHQ9所述症状不同程度制约的抑郁症检测方法;临床医生在抑郁症筛查过程中使用的问卷;在三个社交媒体数据集的数据集传输实验中,我们发现将模型以PHQ9的症状为基础,大大改善了其与标准的BERT方法相比,将分配外的数据普遍化的能力;此外,这一方法仍然可以在主数据上竞争。这些结果和我们的质量分析表明,将模型预测建立在临床相关症状上可以提高通用性,同时产生一种更容易检查的模式。