TrimTail: 使用简单但有效的分光级长期徒刑的低寿命递解性ASR (TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty) - 专知论文

会员服务 ·

0

流 · SimPLe · 语音识别 · 损失 · 模型评估 ·

2023 年 1 月 22 日

TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty

翻译：TrimTail: 使用简单但有效的分光级长期徒刑的低寿命递解性ASR

Xingchen Song,Di Wu,Zhiyong Wu,Binbin Zhang,Yuekai Zhang,Zhendong Peng,Wenpeng Li,Fuping Pan,Changbao Zhu

from arxiv, submitted to ICASSP 2023

In this paper, we present TrimTail, a simple but effective emission regularization method to improve the latency of streaming ASR models. The core idea of TrimTail is to apply length penalty (i.e., by trimming trailing frames, see Fig. 1-(b)) directly on the spectrogram of input utterances, which does not require any alignment. We demonstrate that TrimTail is computationally cheap and can be applied online and optimized with any training loss or any model architecture on any dataset without any extra effort by applying it on various end-to-end streaming ASR networks either trained with CTC loss [1] or Transducer loss [2]. We achieve 100 $\sim$ 200ms latency reduction with equal or even better accuracy on both Aishell-1 and Librispeech. Moreover, by using TrimTail, we can achieve a 400ms algorithmic improvement of User Sensitive Delay (USD) with an accuracy loss of less than 0.2.

翻译：本文介绍TrimTail(TrimTail),这是一个简单而有效的排放规范化方法,用于改善流动 ASR 模型的延缓度。 TrimTail 的核心思想是直接对输入语句的光谱图进行长度处罚(即通过剪裁跟踪框架,见Fig.1-(b)),这不需要任何校正。我们证明TrimTail是计算便宜的,可以在网上应用,并且可以在任何数据集上以任何培训损失或任何模型结构进行优化,而无需付出任何额外的努力,在各种终端到终端流动 ASR 网络中应用该方法,或者在以 CCT 损失 [1 或 Transduker损失 [2] 培训的网络中应用该方法。我们在Aishell-1 和 Librispeech 上都实现了100 $simm $ 200 ms latency reduction,同时在 Aishell-1 和 Librispeech 上都实现400 mus logical squlational sabis gread salution of Unial relat (US)

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

由强剪切应力场引发的高接枝率聚烯烃接枝马来酸酐极性基团的熔融挤出研究

国家自然科学基金

0+阅读 · 2013年12月31日

火焰中多环芳烃（PAHs）的生成和演变机理

国家自然科学基金

0+阅读 · 2013年12月31日

基于HEVC的多视点视频加深度三维视频编码快速算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

RBP-4对猪卵泡颗粒细胞增殖的影响及其机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

蝎毒增殖肽BmKpp促辐射后造血细胞增殖作用的分子靶标研究

国家自然科学基金

0+阅读 · 2011年12月31日

nanog对牙髓干细胞增殖分化的影响及信号通路调控

国家自然科学基金

0+阅读 · 2011年12月31日

燃煤火焰中间产物的在线监测与污染气体排放预测研究

国家自然科学基金

0+阅读 · 2009年12月31日

思茅藤C21甾体抗骨质疏松活性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

激光合成波长纳米位移测量干涉仪的研制

国家自然科学基金

0+阅读 · 2008年12月31日

Robust online active learning

Arxiv

0+阅读 · 2023年3月15日

Machine Learning Changes the Rules for Flux Limiters

Arxiv

0+阅读 · 2023年3月14日

Input-length-shortening and text generation via attention values

Arxiv

0+阅读 · 2023年3月14日

Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints

Arxiv

0+阅读 · 2023年3月14日

Mesh-SORT: Simple and effective location-wise tracker with lost management strategies

Arxiv

0+阅读 · 2023年3月12日

Correlation between upstreamness and downstreamness in random global value chains

Arxiv

0+阅读 · 2023年3月12日

Estimating a potential without the agony of the partition function

Arxiv

0+阅读 · 2023年3月11日

Attention-based Ensemble for Deep Metric Learning

Arxiv

17+阅读 · 2018年4月2日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

Deep Active Learning for Named Entity Recognition

Arxiv

15+阅读 · 2018年2月4日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

人工智能赋能自主武器与人类控制第一部分：人类控制与机器学习的设计和开发 | 46页

军事指挥控制系统：2025年5种用途

人工智能赋能自主武器与人类控制第二部分：人类控制与军事指挥官 | 38页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

相关论文

Robust online active learning

Arxiv

0+阅读 · 2023年3月15日

Machine Learning Changes the Rules for Flux Limiters

Arxiv

0+阅读 · 2023年3月14日

Input-length-shortening and text generation via attention values

Arxiv

0+阅读 · 2023年3月14日

Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints

Arxiv

0+阅读 · 2023年3月14日

Mesh-SORT: Simple and effective location-wise tracker with lost management strategies

Arxiv

0+阅读 · 2023年3月12日

Correlation between upstreamness and downstreamness in random global value chains

Arxiv

0+阅读 · 2023年3月12日

Estimating a potential without the agony of the partition function

Arxiv

0+阅读 · 2023年3月11日

Attention-based Ensemble for Deep Metric Learning

Arxiv

17+阅读 · 2018年4月2日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

Deep Active Learning for Named Entity Recognition

Arxiv

15+阅读 · 2018年2月4日

相关基金

由强剪切应力场引发的高接枝率聚烯烃接枝马来酸酐极性基团的熔融挤出研究

国家自然科学基金

0+阅读 · 2013年12月31日

火焰中多环芳烃（PAHs）的生成和演变机理

国家自然科学基金

0+阅读 · 2013年12月31日

基于HEVC的多视点视频加深度三维视频编码快速算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

RBP-4对猪卵泡颗粒细胞增殖的影响及其机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

蝎毒增殖肽BmKpp促辐射后造血细胞增殖作用的分子靶标研究

国家自然科学基金

0+阅读 · 2011年12月31日

nanog对牙髓干细胞增殖分化的影响及信号通路调控

国家自然科学基金

0+阅读 · 2011年12月31日

燃煤火焰中间产物的在线监测与污染气体排放预测研究

国家自然科学基金

0+阅读 · 2009年12月31日

思茅藤C21甾体抗骨质疏松活性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

激光合成波长纳米位移测量干涉仪的研制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员