With the global population aging rapidly, Alzheimer's disease (AD) is particularly prominent in older adults, which has an insidious onset and leads to a gradual, irreversible deterioration in cognitive domains (memory, communication, etc.). Speech-based AD detection opens up the possibility of widespread screening and timely disease intervention. Recent advances in pre-trained models motivate AD detection modeling to shift from low-level features to high-level representations. This paper presents several efficient methods to extract better AD-related cues from high-level acoustic and linguistic features. Based on these features, the paper also proposes a novel task-oriented approach by modeling the relationship between the participants' description and the cognitive task. Experiments are carried out on the ADReSS dataset in a binary classification setup, and models are evaluated on the unseen test set. Results and comparison with recent literature demonstrate the efficiency and superior performance of proposed acoustic, linguistic and task-oriented methods. The findings also show the importance of semantic and syntactic information, and feasibility of automation and generalization with the promising audio-only and task-oriented methods for the AD detection task.
翻译:随着全球人口迅速老龄化,老年阿尔茨海默症(AD)在老年人中特别突出,这种疾病具有阴险的发端,导致认知领域(模拟、通信等)的逐渐、不可逆转的恶化。 基于语音的自动检测开辟了广泛筛选和及时疾病干预的可能性。在经过培训的模型中,最近的进展鼓励了AD检测模型从低级别特征向高层次表现转变。本文件介绍了从高层声学和语言特征中更好地提取与AD相关的信号的几种有效方法。基于这些特点,本文件还提出了一种新的任务导向方法,为参与者描述和认知任务任务之间的关系建模。在二元分类设置中对ADRESS数据集进行了实验,并对无形测试集模型进行了评估。结果和与最新文献的比较表明,拟议的音响、语言和任务导向方法的效率和优异性表现。结果还表明,语义和合成信息的重要性,以及自动化和普及与有希望的音频和任务导向的方法对于AD检测任务的可行性。</s>