自监督多任务学习的方法在自动重度评估语音障碍中的应用 (Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning) - 专知论文

会员服务 ·

0

Learning · MoDELS · 语音识别 · 基准 · Performer ·

2023 年 3 月 22 日

Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning

翻译：自监督多任务学习的方法在自动重度评估语音障碍中的应用

Eun Jung Yeo,Kwanghee Choi,Sunhee Kim,Minhwa Chung

from arxiv, Accepted to ICASSP 2023

Automatic assessment of dysarthric speech is essential for sustained treatments and rehabilitation. However, obtaining atypical speech is challenging, often leading to data scarcity issues. To tackle the problem, we propose a novel automatic severity assessment method for dysarthric speech, using the self-supervised model in conjunction with multi-task learning. Wav2vec 2.0 XLS-R is jointly trained for two different tasks: severity classification and auxiliary automatic speech recognition (ASR). For the baseline experiments, we employ hand-crafted acoustic features and machine learning classifiers such as SVM, MLP, and XGBoost. Explored on the Korean dysarthric speech QoLT database, our model outperforms the traditional baseline methods, with a relative percentage increase of 1.25% for F1-score. In addition, the proposed model surpasses the model trained without ASR head, achieving 10.61% relative percentage improvements. Furthermore, we present how multi-task learning affects the severity classification performance by analyzing the latent representations and regularization effect.

翻译：自动评估语音障碍对于持续治疗和康复至关重要。然而，获取非典型语音的困难往往会导致数据稀缺问题。为了解决这个问题，我们提出了一种新颖的自监督模型与多任务学习相结合的语音障碍严重性自动评估方法。Wav2vec 2.0 XLS-R同时进行两个不同的任务：严重性分类和辅助自动语音识别（ASR）。对于基准实验，我们采用手工制作的声学特征和机器学习分类器，如SVM，MLP和XGBoost。在韩国语音障碍QoLT数据库中进行实验，我们的模型优于传统的基准方法，F1得分相对百分比提高了1.25％。此外，所提出的模型超越了没有ASR头部训练的模型，实现了10.61％的相对百分比提高。此外，我们通过分析潜在表示和正则化效应展示了多任务学习对严重程度分类性能的影响。

0

相关内容

Learning

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【AAAI2021】用于多标签图像分类的深度语义词典学习

【AAAI2021】用于多标签图像分类的深度语义词典学习

专知会员服务

15+阅读 · 2020年12月30日

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

专知会员服务

17+阅读 · 2020年5月6日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【AAAI2020接受论文】Emu:使用语义专门化增强多语言句子嵌入，Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

【AAAI2020接受论文】Emu:使用语义专门化增强多语言句子嵌入，Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

专知会员服务

26+阅读 · 2019年11月11日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

microRNAs调控自噬治疗大鼠脊髓损伤的实验研究

国家自然科学基金

0+阅读 · 2015年12月31日

Snk-SPAR通路介导微波辐射后树突棘可塑性异常的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

p53基因突变促进Wilms 肿瘤发展转移的小鼠动物模型研究

国家自然科学基金

0+阅读 · 2014年12月31日

线粒体calpain-1调控活性氧产生在糖尿病心肌病发生中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

基于连续循环平移理论的Shearlet域稀疏表示SAR图像去噪算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

杏仁核-海马CA1区-前额叶皮层环路异常在抑郁发生中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

miR-370-LIN28A信号通路在肝癌发生发展中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

青少年首发抑郁症执行功能、DTI、fMRI变化及其病理机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Asperger综合症情绪认知的神经心理调控机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于协同学的并行多层次反馈图像理解研究

国家自然科学基金

1+阅读 · 2008年12月31日

Shared and Private Information Learning in Multimodal Sentiment Analysis with Deep Modal Alignment and Self-supervised Multi-Task Learning

Arxiv

0+阅读 · 2023年5月15日

Curriculum Learning for Relative Overgeneralization

Arxiv

0+阅读 · 2023年5月15日

Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting

Arxiv

0+阅读 · 2023年5月11日

Domain Agnostic Image-to-image Translation using Low-Resolution Conditioning

Arxiv

0+阅读 · 2023年5月11日

Investigating self-supervised, weakly supervised and fully supervised training approaches for multi-domain automatic speech recognition: a study on Bangladeshi Bangla

Arxiv

0+阅读 · 2023年5月11日

Estimating the Personality of White-Box Language Models

Arxiv

0+阅读 · 2023年5月10日

Multi-Prompt with Depth Partitioned Cross-Modal Learning

Arxiv

0+阅读 · 2023年5月10日

Fine-tuning Language Models with Generative Adversarial Feedback

Arxiv

0+阅读 · 2023年5月9日

ExSum: From Local Explanations to Model Understanding

Arxiv

13+阅读 · 2022年4月30日

An Overview on Machine Translation Evaluation

An Overview on Machine Translation Evaluation

Arxiv

14+阅读 · 2022年2月22日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【AAAI2021】用于多标签图像分类的深度语义词典学习

【AAAI2021】用于多标签图像分类的深度语义词典学习

专知会员服务

15+阅读 · 2020年12月30日

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

专知会员服务

17+阅读 · 2020年5月6日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【AAAI2020接受论文】Emu:使用语义专门化增强多语言句子嵌入，Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

【AAAI2020接受论文】Emu:使用语义专门化增强多语言句子嵌入，Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

专知会员服务

26+阅读 · 2019年11月11日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Shared and Private Information Learning in Multimodal Sentiment Analysis with Deep Modal Alignment and Self-supervised Multi-Task Learning

Arxiv

0+阅读 · 2023年5月15日

Curriculum Learning for Relative Overgeneralization

Arxiv

0+阅读 · 2023年5月15日

Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting

Arxiv

0+阅读 · 2023年5月11日

Domain Agnostic Image-to-image Translation using Low-Resolution Conditioning

Arxiv

0+阅读 · 2023年5月11日

Investigating self-supervised, weakly supervised and fully supervised training approaches for multi-domain automatic speech recognition: a study on Bangladeshi Bangla

Arxiv

0+阅读 · 2023年5月11日

Estimating the Personality of White-Box Language Models

Arxiv

0+阅读 · 2023年5月10日

Multi-Prompt with Depth Partitioned Cross-Modal Learning

Arxiv

0+阅读 · 2023年5月10日

Fine-tuning Language Models with Generative Adversarial Feedback

Arxiv

0+阅读 · 2023年5月9日

ExSum: From Local Explanations to Model Understanding

Arxiv

13+阅读 · 2022年4月30日

An Overview on Machine Translation Evaluation

An Overview on Machine Translation Evaluation

Arxiv

14+阅读 · 2022年2月22日

相关基金

microRNAs调控自噬治疗大鼠脊髓损伤的实验研究

国家自然科学基金

0+阅读 · 2015年12月31日

Snk-SPAR通路介导微波辐射后树突棘可塑性异常的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

p53基因突变促进Wilms 肿瘤发展转移的小鼠动物模型研究

国家自然科学基金

0+阅读 · 2014年12月31日

线粒体calpain-1调控活性氧产生在糖尿病心肌病发生中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

基于连续循环平移理论的Shearlet域稀疏表示SAR图像去噪算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

杏仁核-海马CA1区-前额叶皮层环路异常在抑郁发生中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

miR-370-LIN28A信号通路在肝癌发生发展中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

青少年首发抑郁症执行功能、DTI、fMRI变化及其病理机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Asperger综合症情绪认知的神经心理调控机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于协同学的并行多层次反馈图像理解研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员