In a depression-diagnosis-directed clinical session, doctors initiate a conversation with ample emotional support that guides the patients to expose their symptoms based on clinical diagnosis criteria. Such a dialog is a combination of task-oriented and chitchat, different from traditional single-purpose human-machine dialog systems. However, due to the social stigma associated with mental illness, the dialogue data related to depression consultation and diagnosis are rarely disclosed. Though automatic dialogue-based diagnosis foresees great application potential, data sparsity has become one of the major bottlenecks restricting research on such task-oriented chat dialogues. Based on clinical depression diagnostic criteria ICD-11 and DSM-5, we construct the D$^4$: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat which simulates the dialogue between doctors and patients during the diagnosis of depression, including diagnosis results and symptom summary given by professional psychiatrists for each dialogue.Finally, we finetune on state-of-the-art pre-training models and respectively present our dataset baselines on four tasks including response generation, topic prediction, dialog summary, and severity classification of depressive episode and suicide risk. Multi-scale evaluation results demonstrate that a more empathy-driven and diagnostic-accurate consultation dialogue system trained on our dataset can be achieved compared to rule-based bots.
翻译:在抑郁诊断临床治疗过程中,医生在情感上提供充足支持,引导患者根据临床诊断标准披露其症状。这种对话是任务导向和切合的组合,不同于传统的单一目的的人体机器对话系统。然而,由于与精神疾病有关的社会耻辱,与抑郁症咨询和诊断有关的对话数据很少披露。虽然基于对话的自动诊断预见了巨大的应用潜力,但数据松散已成为限制这类任务导向性聊天研究的主要瓶颈之一。根据临床抑郁症诊断标准ICD-11和DSM-5,我们建造了4美元:中国抑郁症诊断诊断和自杀风险对话数据集,该数据集模拟了诊断抑郁症诊断期间医生和病人之间的对话,包括诊断结果和专业心理医生为每次对话提供的症状摘要。最后,我们微调了最先进的培训前模式,并分别介绍了我们关于四项任务的数据设置基线,包括应对措施的生成、主题预测、对话摘要和重度分类,以及压抑症和自杀风险对话。经过培训的多尺度评估结果可以显示,我们经过更深入的诊断性诊断性分析系统可实现的诊断性分析结果。