The use of deep learning for electroencephalography (EEG) classification tasks has been rapidly growing in the last years, yet its application has been limited by the relatively small size of EEG datasets. Data augmentation, which consists in artificially increasing the size of the dataset during training, has been a key ingredient to obtain state-of-the-art performances across applications such as computer vision or speech. While a few augmentation transformations for EEG data have been proposed in the literature, their positive impact on performance across tasks remains elusive. In this work, we propose a unified and exhaustive analysis of the main existing EEG augmentations, which are compared in a common experimental setting. Our results highlight the best data augmentations to consider for sleep stage classification and motor imagery brain computer interfaces, showing predictive power improvements greater than 10% in some cases.
翻译:在过去几年里,对电子脑造影学(EEG)分类任务进行深层次学习的情况迅速增加,但其应用却由于EEG数据集规模较小而受到限制。数据扩增(包括培训期间人工增加数据集规模)是获得计算机视觉或语音等各种应用的最新性能的一个关键要素。虽然文献中提议对EEEG数据进行几处增强性能转换,但其对跨任务性能的积极影响仍然难以捉摸。在这项工作中,我们提议对现有的主要EEEG扩增进行统一和详尽的分析,在共同的实验环境中进行比较。我们的结果突出了用于睡眠阶段分类和运动图像脑计算机界面的最佳数据扩增,在某些情况下显示预测能力改善超过10%。