以专题性变革型基于专题的多模式抑郁症探测模型 (A Topic-Attentive Transformer-based Model For Multimodal Depression Detection)

Depression is one of the most common mental disorders, which imposes heavy negative impacts on one's daily life. Diagnosing depression based on the interview is usually in the form of questions and answers. In this process, the audio signals and their text transcripts of a subject are correlated to depression cues and easily recorded. Therefore, it is feasible to build an Automatic Depression Detection (ADD) model based on the data of these modalities in practice. However, there are two major challenges that should be addressed for constructing an effective ADD model. The first challenge is the organization of the textual and audio data, which can be of various contents and lengths for different subjects. The second challenge is the lack of training samples due to the privacy concern. Targeting to these two challenges, we propose the TOpic ATtentive transformer-based ADD model, abbreviated as TOAT. To address the first challenge, in the TOAT model, topic is taken as the basic unit of the textual and audio data according to the question-answer form in a typical interviewing process. Based on that, a topic attention module is designed to learn the importance of of each topic, which helps the model better retrieve the depressed samples. To solve the issue of data scarcity, we introduce large pre-trained models, and the fine-tuning strategy is adopted based on the small-scale ADD training data. We also design a two-branch architecture with a late-fusion strategy for building the TOAT model, in which the textual and audio data are encoded independently. We evaluate our model on the multimodal DAIC-WOZ dataset specifically designed for the ADD task. Experimental results show the superiority of our method. More importantly, the ablation studies demonstrate the effectiveness of the key elements in the TOAT model.

翻译：抑郁是一种最常见的精神失常,对一个人的日常生活造成严重的负面影响。根据访谈诊断抑郁通常采取问答的形式。在这个过程中,一个主题的音频信号及其文字记录与抑郁信号相关,并容易记录。因此,根据这些方式的数据建立自动抑郁症检测模式是可行的。然而,在构建一个有效的ADD模型方面,需要解决两大挑战。第一个挑战是组织文本和音频数据,这些数据可以是不同科目的不同内容和长度。第二个挑战是缺乏培训样本,因为隐私问题。针对这两个挑战,我们建议了基于TOAT的TOat 模型,建立一个基于ADD模型的自动抑郁症检测模型。在典型的面试过程中,将主题视为文本和音频数据的基本单元。根据这个模型,设计了一个主题关注模块,以学习每个主题的精度样本的重要性。针对这两个主题,我们提出了基于ATDDDD的高级变压器模型模型,我们用一个更精确的模型展示了一个模型的模型,我们用一个更精确的模型来演示一个模型的模型。我们用一个更精确的模型来模拟的模型,我们用一个更精确的模型来模拟的模型来显示一个模型的模型的模型。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日