The onset of the COVID-19 pandemic has brought the mental health of people under risk. Social counselling has gained remarkable significance in this environment. Unlike general goal-oriented dialogues, a conversation between a patient and a therapist is considerably implicit, though the objective of the conversation is quite apparent. In such a case, understanding the intent of the patient is imperative in providing effective counselling in therapy sessions, and the same applies to a dialogue system as well. In this work, we take forward a small but an important step in the development of an automated dialogue system for mental-health counselling. We develop a novel dataset, named HOPE, to provide a platform for the dialogue-act classification in counselling conversations. We identify the requirement of such conversation and propose twelve domain-specific dialogue-act (DAC) labels. We collect 12.9K utterances from publicly-available counselling session videos on YouTube, extract their transcripts, clean, and annotate them with DAC labels. Further, we propose SPARTA, a transformer-based architecture with a novel speaker- and time-aware contextual learning for the dialogue-act classification. Our evaluation shows convincing performance over several baselines, achieving state-of-the-art on HOPE. We also supplement our experiments with extensive empirical and qualitative analyses of SPARTA.
翻译:在这种环境下,与一般的面向目标的对话不同,病人和治疗师之间的谈话是相当隐含的,尽管谈话的目的相当明显。在这种情况下,了解病人的意图对于在治疗过程中提供有效咨询至关重要,对对话系统也是如此。在这项工作中,我们在建立心理健康咨询自动对话系统方面迈出了一个小但重要的一步。我们开发了一个名为HOPE的新数据集,为咨询谈话的对话-行为分类提供一个平台。我们确定这种对话的要求,并提出12个具体领域对话-行为标签。我们从YouTube上公开提供的咨询会议视频中收集12.9K语句,提取他们的笔录,清洁,并用发援会标签注释他们。此外,我们提出SPARTA,一个基于变压器的架构,为对话活动分类提供新的演讲人和有时间意识的背景学习。我们的评价显示,在几个基线上取得了令人信服的业绩,还实现了对SPA-PA的定性分析。我们还进行了大量关于SAR-PE的定性分析。