老年阿尔茨海默氏病检测语言识别系统 (Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection) - 专知论文

会员服务 ·

0

Conformer · 语音识别 · 隐藏单元 · Learning · state-of-the-art ·

2022 年 6 月 23 日

Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection

翻译：老年阿尔茨海默氏病检测语言识别系统

Tianzi Wang,Jiajun Deng,Mengzhe Geng,Zi Ye,Shoukang Hu,Yi Wang,Mingyu Cui,Zengrui Jin,Xunying Liu,Helen Meng

from arxiv, 5 pages, 1 figure, accepted by INTERSPEECH 2022

Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care to delay further progression. This paper presents the development of a state-of-the-art Conformer based speech recognition system built on the DementiaBank Pitt corpus for automatic AD detection. The baseline Conformer system trained with speed perturbation and SpecAugment based data augmentation is significantly improved by incorporating a set of purposefully designed modeling features, including neural architecture search based auto-configuration of domain-specific Conformer hyper-parameters in addition to parameter fine-tuning; fine-grained elderly speaker adaptation using learning hidden unit contributions (LHUC); and two-pass cross-system rescoring based combination with hybrid TDNN systems. An overall word error rate (WER) reduction of 13.6% absolute (34.8% relative) was obtained on the evaluation data of 48 elderly speakers. Using the final systems' recognition outputs to extract textual features, the best-published speech recognition based AD detection accuracy of 91.7% was obtained.

翻译：早期诊断阿尔茨海默氏病(AD)对于促进预防性护理以延缓进一步的进展至关重要。本文件介绍了在Dementia Bankk Pittamp 上开发一个基于最先进的基于Confer 的语音识别系统,用于自动自动自动检测。基准Confer 系统通过纳入一套专门设计的模型特征而得到显著改进,其中包括基于神经结构的自动配置,除参数微调外,还基于特定域的超强参数的自动配置;使用学习的隐藏单位贡献(LHUC)进行精密的老年人语音识别系统调整;与混合的TDNNN系统进行双空跨系统连接。根据48名老年人的评价数据,实现了13.6%的绝对(34.8%)的总体误差率(WER)下降。利用最后的系统识别输出来提取文字特征,获得了91.7%基于自动检测精度的最佳公开语音识别。

0

相关内容

Conformer

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

TRPM7在神经细胞缺血损伤中的作用及DREAM对TRPM7调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

遗忘型轻度认知障碍患者内颞叶记忆网络动态变化研究

国家自然科学基金

0+阅读 · 2015年12月31日

ADIPOR1基因变异在2型糖尿病合并冠心病中的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于线粒体蛋白质组学的针刺治疗阿尔茨海默病的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于tau蛋白代谢通路基因多态性和多模态fMRI的遗忘型轻度认知障碍神经网络机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

木霉诱导下杨树ARF转录因子对其生长及抗病的分子调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

老年性痴呆脑微环境对海马移植NSCs的作用及针刺干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用遗传影像策略探寻遗忘型轻度认知损害神经影像特征

国家自然科学基金

0+阅读 · 2011年12月31日

丙戊酸钠对阿尔茨海默病的保护作用及机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

整合素β#20449;号通路在非小细胞肺癌EGFR TKI耐药中的作用

国家自然科学基金

0+阅读 · 2008年12月31日

Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition

Arxiv

0+阅读 · 2022年8月16日

An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection

Arxiv

0+阅读 · 2022年8月15日

Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions

Arxiv

0+阅读 · 2022年8月14日

Pre-training Tasks for User Intent Detection and Embedding Retrieval in E-commerce Search

Arxiv

0+阅读 · 2022年8月12日

Patient-Specific Game-Based Transfer Method for Parkinson's Disease Severity Prediction

Arxiv

0+阅读 · 2022年8月12日

EDTER: Edge Detection with Transformer

Arxiv

11+阅读 · 2022年3月16日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Pix2seq: A Language Modeling Framework for Object Detection

Arxiv

10+阅读 · 2021年9月22日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

From Superpixel to Human Shape Modelling for Carried Object Detection

Arxiv

10+阅读 · 2018年1月10日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025教程】基础模型遇见具身智能体

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

扩散模型中的缓存方法综述：迈向高效的多模态生成

【ICCV2025教程】《迈向视觉语言模型的全面推理》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition

Arxiv

0+阅读 · 2022年8月16日

An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection

Arxiv

0+阅读 · 2022年8月15日

Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions

Arxiv

0+阅读 · 2022年8月14日

Pre-training Tasks for User Intent Detection and Embedding Retrieval in E-commerce Search

Arxiv

0+阅读 · 2022年8月12日

Patient-Specific Game-Based Transfer Method for Parkinson's Disease Severity Prediction

Arxiv

0+阅读 · 2022年8月12日

EDTER: Edge Detection with Transformer

Arxiv

11+阅读 · 2022年3月16日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Pix2seq: A Language Modeling Framework for Object Detection

Arxiv

10+阅读 · 2021年9月22日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

From Superpixel to Human Shape Modelling for Carried Object Detection

Arxiv

10+阅读 · 2018年1月10日

相关基金

TRPM7在神经细胞缺血损伤中的作用及DREAM对TRPM7调控机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

遗忘型轻度认知障碍患者内颞叶记忆网络动态变化研究

国家自然科学基金

0+阅读 · 2015年12月31日

ADIPOR1基因变异在2型糖尿病合并冠心病中的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于线粒体蛋白质组学的针刺治疗阿尔茨海默病的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于tau蛋白代谢通路基因多态性和多模态fMRI的遗忘型轻度认知障碍神经网络机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

木霉诱导下杨树ARF转录因子对其生长及抗病的分子调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

老年性痴呆脑微环境对海马移植NSCs的作用及针刺干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用遗传影像策略探寻遗忘型轻度认知损害神经影像特征

国家自然科学基金

0+阅读 · 2011年12月31日

丙戊酸钠对阿尔茨海默病的保护作用及机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

整合素β#20449;号通路在非小细胞肺癌EGFR TKI耐药中的作用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员