Dementia is a growing problem as our society ages, and detection methods are often invasive and expensive. Recent deep-learning techniques can offer a faster diagnosis and have shown promising results. However, they require large amounts of labelled data which is not easily available for the task of dementia detection. One effective solution to sparse data problems is data augmentation, though the exact methods need to be selected carefully. To date, there has been no empirical study of data augmentation on Alzheimer's disease (AD) datasets for NLP and speech processing. In this work, we investigate data augmentation techniques for the task of AD detection and perform an empirical evaluation of the different approaches on two kinds of models for both the text and audio domains. We use a transformer-based model for both domains, and SVM and Random Forest models for the text and audio domains, respectively. We generate additional samples using traditional as well as deep learning based methods and show that data augmentation improves performance for both the text- and audio-based models and that such results are comparable to state-of-the-art results on the popular ADReSS set, with carefully crafted architectures and features.
翻译:痴呆症是随着我们社会老化而日益加剧的一个问题,而检测方法往往具有侵入性和昂贵性。最近的深层学习技术可以提供更快的诊断,并显示出有希望的结果。然而,它们需要大量贴标签的数据,而这些数据对于痴呆症的检测工作来说不容易获得。数据稀少问题的一个有效解决办法是数据增加,尽管需要仔细选择确切的方法。到目前为止,还没有对用于NLP和语音处理的阿尔茨海默症(AD)数据集的扩大数据进行经验性研究。在这项工作中,我们调查用于自动检测的数据增加技术,并对两种文本和音频域模式的不同方法进行实证性评估。我们分别使用基于变压器的域模型以及用于文本和音频域的SVM和随机森林模型。我们利用传统和深层次的学习方法生成了更多的样本,并表明数据增加提高了基于文本和音频模型的性能,而这种结果与流行的ADRESS数据集的状态结果相似,并附有精心设计的架构和特征。