Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway shaping during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 subjects performing linguistically motivated speech tasks, alongside the corresponding first-ever public domain raw RT-MRI data. The dataset also includes 3D volumetric vocal tract MRI during sustained speech sounds and high-resolution static anatomical T2-weighted upper airway MRI for each subject.
翻译:人类言语制作的实时磁共振成像(RT-MRI)使语音科学、语言学、生物刺激的语音技术发展和临床应用方面取得重大进展。然而,使用RT-MRI的便利有限,而且需要能够广泛利用的综合数据集来推动多个领域的研究。在语音制作过程中快速移动的动画和动态气道成像的成像要求高时分辨率和健全的重建方法。此外,虽然已经公布了重建的图像,但迄今为止还没有开放的数据集提供来自优化语音制作实验装置的原始多焦耳RT-MRI数据。这类数据集可以促成新的和改进的方法,用于动态图像重建、文物校正、地貌提取和直接提取与语言有关的生物标志。目前的数据集提供了一套独特的2D 斜视 RT-MRI视频和同步的音频,用于执行语言驱动性语音任务的75个主题,以及相应的首个公共域原始RT-MRI数据。这类数据集还包含用于每个分辨率高分辨率的3D-平流式磁共振动磁共振动磁共振成像。