中语语音障碍的深入学习自动分类 (Deep Learning-based automated classification of Chinese Speech Sound Disorders)

This article describes a system for analyzing acoustic data to assist in the diagnosis and classification of children's speech sound disorders (SSDs) using a computer. The analysis concentrated on identifying and categorizing four distinct types of Chinese SSDs. The study collected and generated a speech corpus containing 2540 stopping, backing, final consonant deletion process (FCDP), and affrication samples from 90 children aged 3--6 years with normal or pathological articulatory features. Each recording was accompanied by a detailed diagnostic annotation by two speech-language pathologists (SLPs). Classification of the speech samples was accomplished using three well-established neural network models for image classification. The feature maps were created using three sets of Mel-frequency cepstral coefficients (MFCC) parameters extracted from speech sounds and aggregated into a three-dimensional data structure as model input. We employed six techniques for data augmentation to augment the available dataset while avoiding overfitting. The experiments examine the usability of four different categories of Chinese phrases and characters. Experiments with different data subsets demonstrate the system's ability to accurately detect the analyzed pronunciation disorders. The best multi-class classification using a single Chinese phrase achieves an accuracy of 74.4~percent.

翻译：本文介绍利用计算机分析声学数据以协助诊断和分类儿童言语声音障碍的系统; 分析集中于辨别和分类四种不同类型的中国SDSD, 收集并生成了包含2540个停止、支持、最后一致删除过程(FFDP)的语音资料,以及90个3-6岁儿童具有正常或病理动动脉特征的感应样本; 每记录都配有两种语言病理学家的详细诊断性说明; 语言样本的分类使用三种成熟的图像分类神经网络模型完成; 地貌图使用三套Mel-频率中位系数参数制作,从语音声音中提取并汇总成三维数据结构,作为模型输入; 我们使用了六种数据增强技术,以扩大现有数据集,同时避免过度校正; 实验用四种不同的中文词句和字符的可用性进行了详细诊断性说明; 不同数据组的实验表明系统能够准确检测分析的读音障碍。

相关内容

Automator

关注 0

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日