Self-Rating Depression Scale (SDS) questionnaire has frequently been used for efficient depression preliminary screening. However, the uncontrollable self-administered measure can be easily affected by insouciantly or deceptively answering, and producing the different results with the clinician-administered Hamilton Depression Rating Scale (HDRS) and the final diagnosis. Clinically, facial expression (FE) and actions play a vital role in clinician-administered evaluation, while FE and action are underexplored for self-administered evaluations. In this work, we collect a novel dataset of 200 subjects to evidence the validity of self-rating questionnaires with their corresponding question-wise video recording. To automatically interpret depression from the SDS evaluation and the paired video, we propose an end-to-end hierarchical framework for the long-term variable-length video, which is also conditioned on the questionnaire results and the answering time. Specifically, we resort to a hierarchical model which utilizes a 3D CNN for local temporal pattern exploration and a redundancy-aware self-attention (RAS) scheme for question-wise global feature aggregation. Targeting for the redundant long-term FE video processing, our RAS is able to effectively exploit the correlations of each video clip within a question set to emphasize the discriminative information and eliminate the redundancy based on feature pair-wise affinity. Then, the question-wise video feature is concatenated with the questionnaire scores for final depression detection. Our thorough evaluations also show the validity of fusing SDS evaluation and its video recording, and the superiority of our framework to the conventional state-of-the-art temporal modeling methods.
翻译:自我乐观程度(SDS)问卷经常被用于高效的抑郁症初步筛查,然而,无法控制的自我管理措施很容易受到无意识或欺骗性回答的影响,并且通过临床医生管理的汉密尔顿抑郁症降压幅度(HDRS)和最终诊断得出不同的结果。临床上,面部表达和行动在诊所管理的评估中发挥着关键作用,而FE和行动在自我管理的评估中探索不足。在这项工作中,我们收集了200个主题的新数据集,以证明自我管理调查表及其相应的问答视频记录的有效性。为了自动解释SDS评估和配对视频的抑郁症,我们建议为长期的多长视频降压幅度(HDRS)和最终诊断生成一个端对端的等级框架,同时以问卷结果和回答时间为条件。我们采用一个等级模型3DCNN(FN)来进行时间模式探索,并用冗余自觉框架(RAS)来证明自我管理全球特征汇总的自定义框架(RAS)的有效性。为了从SDS评价和配对配视频的配对齐视频记录中自动解读,我们每次对等的升级的视频评估,也有效地利用了我们用于S-Slady Stal-SDERS的SSSSS的SS-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-C-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-