串串式端到端语言理解框架 (A Streaming End-to-End Framework For Spoken Language Understanding) - 专知论文

会员服务 ·

0

可理解性 · 可辨认的 · 流 · 端到端 · 任务对话系统 ·

2021 年 5 月 20 日

A Streaming End-to-End Framework For Spoken Language Understanding

翻译：串串式端到端语言理解框架

Nihal Potdar,Anderson R. Avila,Chao Xing,Dong Wang,Yiran Cao,Xiao Chen

from arxiv, Accepted at IJCAI 2021

End-to-end spoken language understanding (SLU) has recently attracted increasing interest. Compared to the conventional tandem-based approach that combines speech recognition and language understanding as separate modules, the new approach extracts users' intentions directly from the speech signals, resulting in joint optimization and low latency. Such an approach, however, is typically designed to process one intention at a time, which leads users to take multiple rounds to fulfill their requirements while interacting with a dialogue system. In this paper, we propose a streaming end-to-end framework that can process multiple intentions in an online and incremental way. The backbone of our framework is a unidirectional RNN trained with the connectionist temporal classification (CTC) criterion. By this design, an intention can be identified when sufficient evidence has been accumulated, and multiple intentions can be identified sequentially. We evaluate our solution on the Fluent Speech Commands (FSC) dataset and the intent detection accuracy is about 97 % on all multi-intent settings. This result is comparable to the performance of the state-of-the-art non-streaming models, but is achieved in an online and incremental way. We also employ our model to a keyword spotting task using the Google Speech Commands dataset and the results are also highly promising.

翻译：端到端口语理解(SLU)最近引起了越来越多的兴趣。与将语音识别和语言理解作为单独模块的常规同步方法相比,新方法直接从语音信号中提取用户的意图,导致联合优化和低潜伏。然而,这种方法通常旨在一次处理一个意图,使用户在与对话系统互动时采取多轮来满足其要求。在本文件中,我们提议了一个流式端到端框架,可以在线和渐进方式处理多种意图。我们框架的骨干是一个单向式 RNN 标准,在连接器时间分类(CTC)标准方面受过单向性 RNN 培训。通过这一设计,在收集足够证据后可以确定一种意图,并可以按顺序确定多种意图。我们评估了我们关于流式语音指令数据集的解决方案,在所有多功能环境中检测的准确度约为97%。这与状态非流动模式的性能相当,但是在在线和递增式指令中实现了。我们还利用高亮的服务器任务。

0

相关内容

可理解性

【KDD2021】大规模异质图上的预训练框架

专知会员服务

46+阅读 · 2021年6月20日

现代机器学习技术导论，596页pdf

专知会员服务

167+阅读 · 2020年7月27日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

27+阅读 · 2020年7月23日

一份循环神经网络RNNs简明教程，37页ppt

一份循环神经网络RNNs简明教程，37页ppt

专知会员服务

173+阅读 · 2020年5月6日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

机器学习速查手册，135页pdf

机器学习速查手册，135页pdf

专知会员服务

342+阅读 · 2020年3月15日

【上海交大-ICASSP2020】Transformer端到端的多说话人语音识别

【上海交大-ICASSP2020】Transformer端到端的多说话人语音识别

专知会员服务

51+阅读 · 2020年2月16日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

【NLP| 推荐文章】语言语音处理（Speech and Language Processing(3rd ed.draft)）

专知会员服务

15+阅读 · 2019年11月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICML 2019 | 序列到序列自然语言生成任务超越BERT、GPT！微软提出通用预训练模型MASS

ICML 2019 | 序列到序列自然语言生成任务超越BERT、GPT！微软提出通用预训练模型MASS

微软研究院AI头条

5+阅读 · 2019年5月9日

博客 | 常见32项NLP任务及其评价指标和对应达到SOTA的paper

博客 | 常见32项NLP任务及其评价指标和对应达到SOTA的paper

AI研习社

21+阅读 · 2019年4月23日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

博客 | 《Global-Locally Self-Attentive Encoder for DST》阅读笔记

博客 | 《Global-Locally Self-Attentive Encoder for DST》阅读笔记

AI研习社

3+阅读 · 2018年11月28日

Redis Stream 实践

Redis Stream 实践

性能与架构

3+阅读 · 2018年7月21日

推荐｜深度强化学习聊天机器人（附论文）！

推荐｜深度强化学习聊天机器人（附论文）！

全球人工智能

4+阅读 · 2018年1月30日

End-to-End Natural Language Understanding Pipeline for Bangla Conversational Agent

End-to-End Natural Language Understanding Pipeline for Bangla Conversational Agent

Arxiv

0+阅读 · 2021年7月12日

End-to-End Spoken Language Understanding using RNN-Transducer ASR

End-to-End Spoken Language Understanding using RNN-Transducer ASR

Arxiv

0+阅读 · 2021年7月8日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Language Modeling with Deep Transformers

Arxiv

6+阅读 · 2019年7月11日

End to end learning and optimization on graphs

Arxiv

7+阅读 · 2019年5月31日

Multi-Task Deep Neural Networks for Natural Language Understanding

Multi-Task Deep Neural Networks for Natural Language Understanding

Arxiv

3+阅读 · 2019年1月31日

Multi-task learning to improve natural language understanding

Arxiv

4+阅读 · 2018年12月17日

End-to-end Speech Recognition with Word-based RNN Language Models

End-to-end Speech Recognition with Word-based RNN Language Models

Arxiv

3+阅读 · 2018年8月8日

DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks

Arxiv

9+阅读 · 2018年3月1日

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

Arxiv

7+阅读 · 2018年1月18日

VIP会员

文章信息

相关主题

任务对话系统

相关VIP内容

【KDD2021】大规模异质图上的预训练框架

专知会员服务

46+阅读 · 2021年6月20日

现代机器学习技术导论，596页pdf

专知会员服务

167+阅读 · 2020年7月27日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

27+阅读 · 2020年7月23日

一份循环神经网络RNNs简明教程，37页ppt

一份循环神经网络RNNs简明教程，37页ppt

专知会员服务

173+阅读 · 2020年5月6日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

机器学习速查手册，135页pdf

机器学习速查手册，135页pdf

专知会员服务

342+阅读 · 2020年3月15日

【上海交大-ICASSP2020】Transformer端到端的多说话人语音识别

【上海交大-ICASSP2020】Transformer端到端的多说话人语音识别

专知会员服务

51+阅读 · 2020年2月16日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

【NLP| 推荐文章】语言语音处理（Speech and Language Processing(3rd ed.draft)）

专知会员服务

15+阅读 · 2019年11月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《电磁（电子）战：英国能力》最新32页报告

《美军条令：斯特赖克步兵步枪排与班作战条令》最新450页

《美海军分布式海上作战（DMO）概念：最新情况》

《跨时空与跨模态学习事件模式构建体系（LESTAT）》57页DARPA研究报告

相关资讯

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICML 2019 | 序列到序列自然语言生成任务超越BERT、GPT！微软提出通用预训练模型MASS

ICML 2019 | 序列到序列自然语言生成任务超越BERT、GPT！微软提出通用预训练模型MASS

微软研究院AI头条

5+阅读 · 2019年5月9日

博客 | 常见32项NLP任务及其评价指标和对应达到SOTA的paper

博客 | 常见32项NLP任务及其评价指标和对应达到SOTA的paper

AI研习社

21+阅读 · 2019年4月23日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

博客 | 《Global-Locally Self-Attentive Encoder for DST》阅读笔记

博客 | 《Global-Locally Self-Attentive Encoder for DST》阅读笔记

AI研习社

3+阅读 · 2018年11月28日

Redis Stream 实践

Redis Stream 实践

性能与架构

3+阅读 · 2018年7月21日

推荐｜深度强化学习聊天机器人（附论文）！

推荐｜深度强化学习聊天机器人（附论文）！

全球人工智能

4+阅读 · 2018年1月30日

相关论文

End-to-End Natural Language Understanding Pipeline for Bangla Conversational Agent

End-to-End Natural Language Understanding Pipeline for Bangla Conversational Agent

Arxiv

0+阅读 · 2021年7月12日

End-to-End Spoken Language Understanding using RNN-Transducer ASR

End-to-End Spoken Language Understanding using RNN-Transducer ASR

Arxiv

0+阅读 · 2021年7月8日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Language Modeling with Deep Transformers

Arxiv

6+阅读 · 2019年7月11日

End to end learning and optimization on graphs

Arxiv

7+阅读 · 2019年5月31日

Multi-Task Deep Neural Networks for Natural Language Understanding

Multi-Task Deep Neural Networks for Natural Language Understanding

Arxiv

3+阅读 · 2019年1月31日

Multi-task learning to improve natural language understanding

Arxiv

4+阅读 · 2018年12月17日

End-to-end Speech Recognition with Word-based RNN Language Models

End-to-end Speech Recognition with Word-based RNN Language Models

Arxiv

3+阅读 · 2018年8月8日

DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks

Arxiv

9+阅读 · 2018年3月1日

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

Arxiv

7+阅读 · 2018年1月18日

微信扫码咨询专知VIP会员