中语语音障碍的深入学习自动分类 (Deep Learning-based automated classification of Chinese Speech Sound Disorders)

This article describes a system for analyzing acoustic data in order to assist in the diagnosis and classification of children's speech disorders using a computer. The analysis concentrated on identifying and categorizing four distinct types of Chinese misconstructions. The study collected and generated a speech corpus containing 2540 Stopping, Velar, Consonant-vowel, and Affricate samples from 90 children aged 3-6 years with normal or pathological articulatory features. Each recording was accompanied by a detailed annotation from the field of speech therapy. Classification of the speech samples was accomplished using three well-established neural network models for image classification. The feature maps are created using three sets of MFCC parameters extracted from speech sounds and aggregated into a three-dimensional data structure as model input. We employ six techniques for data augmentation in order to augment the available dataset while avoiding over-simulation. The experiments examine the usability of four different categories of Chinese phrases and characters. Experiments with different data subsets demonstrate the system's ability to accurately detect the analyzed pronunciation disorders.

翻译：本篇文章介绍了一个分析声学数据的系统,以便用计算机协助对儿童言语障碍进行诊断和分类; 分析集中于辨别和分类四类中国错误建筑; 研究收集并生成了来自90名3-6岁儿童的具有正常或病理动脉特征的2540个止步器、Velar、Consonant-vowel和Affricate样本,其中含有正常或病理动脉动特征的2540个语音材料; 每项记录都附有语音治疗领域的详细说明; 语言样本的分类使用了三个成熟的神经网络模型,用于图像分类; 地貌地图是用三套MFCC参数绘制的,从语音声音中提取并汇总成三维数据结构作为模型输入的; 我们使用六种数据增强技术,以扩大现有数据集,同时避免过度模拟; 实验审查了四种不同类别的中国语系和字符的可用性; 与不同数据子组的实验表明系统能够准确检测分析的读音障碍。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日