During a psychotherapy session, the counselor typically adopts techniques which are codified along specific dimensions (e.g., 'displays warmth and confidence', or 'attempts to set up collaboration') to facilitate the evaluation of the session. Those constructs, traditionally scored by trained human raters, reflect the complex nature of psychotherapy and highly depend on the context of the interaction. Recent advances in deep contextualized language models offer an avenue for accurate in-domain linguistic representations which can lead to robust recognition and scoring of such psychotherapy-relevant behavioral constructs, and support quality assurance and supervision. In this work, we propose a BERT-based model for automatic behavioral scoring of a specific type of psychotherapy, called Cognitive Behavioral Therapy (CBT), where prior work is limited to frequency-based language features and/or short text excerpts which do not capture the unique elements involved in a spontaneous long conversational interaction. The model focuses on the classification of therapy sessions with respect to the overall score achieved on the widely-used Cognitive Therapy Rating Scale (CTRS), but is trained in a multi-task manner in order to achieve higher interpretability. BERT-based representations are further augmented with available therapy metadata, providing relevant non-linguistic context and leading to consistent performance improvements. We train and evaluate our models on a set of 1,118 real-world therapy sessions, recorded and automatically transcribed. Our best model achieves an F1 score equal to 72.61% on the binary classification task of low vs. high total CTRS.
翻译:心理治疗课程期间,顾问通常采用根据具体层面(例如“展示温暖和信心”或“试图建立协作”)编纂的技术,以促进对会议的评价。这些建构传统上由训练有素的人类计分员评分,反映了心理治疗的复杂性,并高度取决于互动的背景。深背景化语言模型的最新进展为准确的日常语言表述提供了一个渠道,这可以导致对此类心理治疗相关行为结构的有力认识和评分,并支持质量保证和监督。在这项工作中,我们提出了基于BERT的自动行为评分模式,用于对特定类型的心理治疗进行自动行为评分,称为CBT。这些建模传统上由受过训练的人文评分,反映了心理治疗的复杂性质和(或)短文节选,这些模式并不反映自发长的谈话互动中涉及的独特内容。该模式侧重于对广泛使用的心理疗法相关行为结构的总体评分进行分类。我们用的是188BERT评级(CTRS),但以真实行为评分模式进行自动排序。我们用的是不断的理学评分方法,我们用的是不断更新的理算。