一组多发言者的原始和再版语音实时MRI视频和3D体积图象的多语音数据集 (A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images)

Yongwan Lim,Asterios Toutios,Yannick Bliesener,Ye Tian,Sajan Goud Lingala,Colin Vaz,Tanner Sorensen,Miran Oh,Sarah Harper,Weiyi Chen,Yoonjeong Lee,Johannes Töger,Mairym Lloréns Montesserin,Caitlin Smith,Bianca Godinez,Louis Goldstein,Dani Byrd,Krishna S. Nayak,Shrikanth S. Narayanan

from arxiv, 27 pages, 6 figures, 5 tables, submitted to Nature Scientific Data

Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway shaping during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 subjects performing linguistically motivated speech tasks, alongside the corresponding first-ever public domain raw RT-MRI data. The dataset also includes 3D volumetric vocal tract MRI during sustained speech sounds and high-resolution static anatomical T2-weighted upper airway MRI for each subject.

翻译：人类言语制作的实时磁共振成像(RT-MRI)使语音科学、语言学、生物刺激的语音技术发展和临床应用方面取得重大进展。然而,使用RT-MRI的便利有限,而且需要能够广泛利用的综合数据集来推动多个领域的研究。在语音制作过程中快速移动的动画和动态气道成像的成像要求高时分辨率和健全的重建方法。此外,虽然已经公布了重建的图像,但迄今为止还没有开放的数据集提供来自优化语音制作实验装置的原始多焦耳RT-MRI数据。这类数据集可以促成新的和改进的方法,用于动态图像重建、文物校正、地貌提取和直接提取与语言有关的生物标志。目前的数据集提供了一套独特的2D 斜视 RT-MRI视频和同步的音频,用于执行语言驱动性语音任务的75个主题,以及相应的首个公共域原始RT-MRI数据。这类数据集还包含用于每个分辨率高分辨率的3D-平流式磁共振动磁共振动磁共振成像。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

深度学习在医学影像智能处理中的应用与挑战

专知会员服务

83+阅读 · 2021年2月16日