With the availability of voice-enabled devices such as smart phones, mental health disorders could be detected and treated earlier, particularly post-pandemic. The current methods involve extracting features directly from audio signals. In this paper, two methods are used to enrich voice analysis for depression detection: graph transformation of voice signals, and natural language processing of the transcript based on representational learning, fused together to produce final class labels. The results of experiments with the DAIC-WOZ dataset suggest that integration of text-based voice classification and learning from low level and graph-based voice signal features can improve the detection of mental disorders like depression.
翻译:随着智能电话等语音辅助装置的可用性,可以更早地发现和治疗精神疾病,特别是大流行后的疾病。目前的方法包括直接从音频信号中提取特征。在本文中,使用了两种方法来丰富抑郁症检测语音分析内容:语音信号的图解转换,以及基于代表性学习的自然语言处理记录稿,并结合制作最后类标签。DAIC-WOZ数据集的实验结果表明,将基于文字的语音分类和从低水平和基于图表的语音信号特征中学习结合起来可以改善对抑郁症等精神疾病的检测。