语音识别:不同深度学习方法的综述，Speech Recognition: a review of the different deep learning approaches

人类最好是通过使用同一种语言的语音进行交流。语音识别可以被定义为理解说话人所说的话的能力。

自动语音识别(ASR)是指识别人类语音并将其翻译成文本的任务。在过去的几十年里，这一研究领域得到了广泛的关注。它是人机通信的一个重要研究领域。早期的方法集中于人工特征提取和传统的技术，如高斯混合模型(GMM)、动态时间翘曲(DTW)算法和隐马尔可夫模型(HMM)。

近年来，神经网络如循环神经网络(RNN)、卷积神经网络(CNN)以及最近几年的《Transformers》已经应用于ASR，并取得了良好的性能。

成为VIP会员查看完整内容

相关内容

语音识别

关注 753

语音识别是计算机科学和计算语言学的一个跨学科子领域，它发展了一些方法和技术，使计算机可以将口语识别和翻译成文本。它也被称为自动语音识别（ASR），计算机语音识别或语音转文本（STT）。它整合了计算机科学，语言学和计算机工程领域的知识和研究。

【TPAMI2022】深度步态识别研究进展，Deep Gait Recognition: A Survey

专知会员服务

28+阅读 · 2022年3月1日

首篇「课程学习（Curriculum Learning)」2021综述论文

专知会员服务

50+阅读 · 2021年1月31日

注意力机制综述

专知会员服务

83+阅读 · 2021年1月26日

最新《3D医疗图像处理》综述论文，23页pdf，3D Deep Learning on Medical Images: A Review

专知会员服务

60+阅读 · 2020年7月14日

最新《贝叶斯深度学习》综述论文，35页pdf，A Survey on Bayesian Deep Learning

专知会员服务

209+阅读 · 2020年7月5日

深度强化学习方法及其在经济学中的应用综述，Comprehensive Review of Deep Reinforcement Learning Methods and Applicationsin Economic

专知会员服务

52+阅读 · 2020年4月7日

【自监督学习深度神经网络视觉特征学习综述论文】Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey

专知会员服务

87+阅读 · 2020年3月1日

【综述】图像去噪的深度学习:综述，36页pdf，Deep Learning on Image Denoising: An overview

专知会员服务

71+阅读 · 2019年12月31日

【剑桥大学】神经机器翻译综述论文，Neural Machine Translation: A Review，附88页pdf

专知会员服务

37+阅读 · 2019年12月4日

【医学图像分割| 2019新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍（Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications），附35页PDF

专知会员服务

57+阅读 · 2019年11月23日

「图神经网络前沿进展与应用」最新2022综述

专知

19+阅读 · 2022年1月24日

再介绍一篇最新的Contrastive Self-supervised Learning综述论文

夕小瑶的卖萌屋

2+阅读 · 2021年9月22日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

《文本分类大综述：从浅层到深度学习》最新2020版35页pdf

专知

59+阅读 · 2020年8月6日

【干货】AutoML自动机器学习：最新进展综述

专知

27+阅读 · 2019年8月9日

中文对比英文自然语言处理NLP的区别综述

AINLP

18+阅读 · 2019年3月20日

基于深度学习的NLP 32页最新进展综述，190篇参考文献

专知

19+阅读 · 2018年12月4日

【AIDL专栏】陶建华：深度神经网络与语音（附PPT）

人工智能前沿讲习班

12+阅读 · 2018年7月6日

深度学习综述（下载PDF版）

机器学习算法与Python学习

28+阅读 · 2018年7月3日

语音识别之--韩语语音识别

微信AI

16+阅读 · 2017年8月2日

基于人脸表情、身体姿态和语音的多模态情感识别方法研究

国家自然科学基金

10+阅读 · 2015年12月31日

基于深度神经网络的噪声鲁棒性语音识别方法研究

国家自然科学基金

4+阅读 · 2013年12月31日

基于弱指导机器学习技术的中文领域本体非分类关系自动学习研究

国家自然科学基金

0+阅读 · 2013年12月31日

自适应多分辨率宽带频谱压缩感知

国家自然科学基金

0+阅读 · 2012年12月31日

深度学习理论及在图像识别中的应用研究

国家自然科学基金

6+阅读 · 2012年12月31日

语音识别中的稀疏性深度学习

国家自然科学基金

11+阅读 · 2012年12月31日

基于分段条件随机场的连续语音识别技术

国家自然科学基金

1+阅读 · 2011年12月31日

《软件学报》学术期刊

国家自然科学基金

6+阅读 · 2011年12月31日

非特定人自然语音情感识别的建模方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

人工语音带宽扩展新方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks

Arxiv

0+阅读 · 2022年4月18日

Learning-Based Approaches for Graph Problems: A Survey

Arxiv

1+阅读 · 2022年4月17日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

35+阅读 · 2020年9月3日

Deep Learning on Image Denoising: An overview

Arxiv

13+阅读 · 2020年8月3日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Optimization for deep learning: theory and algorithms

Arxiv

106+阅读 · 2019年12月19日

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition

Arxiv

10+阅读 · 2018年4月28日

DeepSeek: Content Based Image Search & Retrieval

Arxiv

13+阅读 · 2018年1月11日

VIP会员