分享低级别低级别小总是在环境中讲话识别模型的低级别分数比重</s> (Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models) - 专知论文

会员服务 ·

0

Conformer · MoDELS · Weight · 语音识别 · 秩 ·

2023 年 3 月 15 日

Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models

翻译：分享低级别低级别小总是在环境中讲话识别模型的低级别分数比重

Steven M. Hernandez,Ding Zhao,Shaojin Ding,Antoine Bruguier,Rohit Prabhavalkar,Tara N. Sainath,Yanzhang He,Ian McGraw

from arxiv, Accepted to IEEE ICASSP 2023

Continued improvements in machine learning techniques offer exciting new opportunities through the use of larger models and larger training datasets. However, there is a growing need to offer these new capabilities on-board low-powered devices such as smartphones, wearables and other embedded environments where only low memory is available. Towards this, we consider methods to reduce the model size of Conformer-based speech recognition models which typically require models with greater than 100M parameters down to just $5$M parameters while minimizing impact on model quality. Such a model allows us to achieve always-on ambient speech recognition on edge devices with low-memory neural processors. We propose model weight reuse at different levels within our model architecture: (i) repeating full conformer block layers, (ii) sharing specific conformer modules across layers, (iii) sharing sub-components per conformer module, and (iv) sharing decomposed sub-component weights after low-rank decomposition. By sharing weights at different levels of our model, we can retain the full model in-memory while increasing the number of virtual transformations applied to the input. Through a series of ablation studies and evaluations, we find that with weight sharing and a low-rank architecture, we can achieve a WER of 2.84 and 2.94 for Librispeech dev-clean and test-clean respectively with a $5$M parameter model.

翻译：机器学习技术的持续改进通过使用更大的模型和更大的培训数据集提供了令人振奋的新机会。然而,越来越需要提供机载低功率装置,如智能手机、磨损器和其他内嵌环境,只有低内存的智能手机、可磨损器和其他内嵌环境等机载低能装置的这些新能力。为此,我们考虑采用一些方法,将基于内存的语音识别模型模型的模型缩小模型规模,这些模型通常需要超过100M参数的模型,降至仅500M美元参数,同时尽量减少对模型质量的影响。这样的模型使我们能够在使用低微神经处理器的边缘设备上始终在环境语音识别。我们提议在模型结构的不同级别上重新利用模型重量:(一) 重复完全合规的区块层,(二) 共享各层的具体合规模块,(三) 共享每个相容模块的子组件,(四) 共享低分解的子构件重量,在低分解后共享。通过在模型的不同级别共享权重,我们可以保留完整的模型,同时增加对投入的虚拟转换数量。我们通过一系列的精度和精度研究,可以分别实现2.94和测试结构的分级,我们通过分级的分级研究和分级的分级的分级研究和分级,可以发现,我们可以找到的分级的分级研究和分级评估。</s>

0

相关内容

Conformer

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

活性氧介导的内质网应激在博莱霉素诱发肺上皮-间质转化和肺纤维化中的作用

国家自然科学基金

0+阅读 · 2016年12月31日

掺杂多铁材料BiFeO3的结构稳定性研究

国家自然科学基金

0+阅读 · 2014年12月31日

DGKε/SNARE信号通路在糖尿病肾病足细胞胰岛素抵抗中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Hg2CuTi型全Heusler合金表面与界面的半金属特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

某些重金属的核酸适配体非标记纳米粒子-SERS光谱分析

国家自然科学基金

0+阅读 · 2012年12月31日

纳米粒子在复合物中分散性定量表征及与介电性关系

国家自然科学基金

0+阅读 · 2012年12月31日

低层错能镍基变形高温合金反常动态应变时效机理

国家自然科学基金

0+阅读 · 2011年12月31日

Li3V2(PO4)3快离子导体掺杂改性LiMnPO4/C纳米复合材料的基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

低碳高锰TRIP/TWIP效应共生钢的变形机制和组织演变

国家自然科学基金

0+阅读 · 2009年12月31日

Ga、Al、In氮化物及其合金和径向异质结纳米线的可控制备和物性研究

国家自然科学基金

0+阅读 · 2008年12月31日

Multi-Step Short-Term Wind Speed Prediction with Rank Pooling and Fast Fourier Transformation

Arxiv

0+阅读 · 2023年5月5日

FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings

Arxiv

0+阅读 · 2023年5月5日

Ensembles of Compact, Region-specific & Regularized Spiking Neural Networks for Scalable Place Recognition

Arxiv

0+阅读 · 2023年5月5日

ZipIt! Merging Models from Different Tasks without Training

Arxiv

0+阅读 · 2023年5月4日

On the Closed-form Weight Enumeration of Polar Codes: 1.5$d$-weight Codewords

Arxiv

0+阅读 · 2023年5月4日

Single-path Bit Sharing for Automatic Loss-aware Model Compression

Arxiv

0+阅读 · 2023年5月4日

Towards Greener and Attention-aware Solutions for Steering Angle Prediction

Arxiv

0+阅读 · 2023年5月3日

Architext: Language-Driven Generative Architecture Design

Arxiv

0+阅读 · 2023年5月3日

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition

Arxiv

12+阅读 · 2021年5月30日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACMMM2025教程】打击网络虚假信息视频：特征分析、检测与防范，170页ppt

海军无人系统：海上作战的演进而非革命

Nature 子刊 | SciToolAgent:知识图谱引导的科学工具智能体

多媒体顶会ACM Multimedia 2025各大奖项揭晓！格拉斯哥大学等获最佳论文，中科院自动化所等获最佳学生论文

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

Multi-Step Short-Term Wind Speed Prediction with Rank Pooling and Fast Fourier Transformation

Arxiv

0+阅读 · 2023年5月5日

FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings

Arxiv

0+阅读 · 2023年5月5日

Ensembles of Compact, Region-specific & Regularized Spiking Neural Networks for Scalable Place Recognition

Arxiv

0+阅读 · 2023年5月5日

ZipIt! Merging Models from Different Tasks without Training

Arxiv

0+阅读 · 2023年5月4日

On the Closed-form Weight Enumeration of Polar Codes: 1.5$d$-weight Codewords

Arxiv

0+阅读 · 2023年5月4日

Single-path Bit Sharing for Automatic Loss-aware Model Compression

Arxiv

0+阅读 · 2023年5月4日

Towards Greener and Attention-aware Solutions for Steering Angle Prediction

Arxiv

0+阅读 · 2023年5月3日

Architext: Language-Driven Generative Architecture Design

Arxiv

0+阅读 · 2023年5月3日

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition

Arxiv

12+阅读 · 2021年5月30日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

相关基金

活性氧介导的内质网应激在博莱霉素诱发肺上皮-间质转化和肺纤维化中的作用

国家自然科学基金

0+阅读 · 2016年12月31日

掺杂多铁材料BiFeO3的结构稳定性研究

国家自然科学基金

0+阅读 · 2014年12月31日

DGKε/SNARE信号通路在糖尿病肾病足细胞胰岛素抵抗中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Hg2CuTi型全Heusler合金表面与界面的半金属特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

某些重金属的核酸适配体非标记纳米粒子-SERS光谱分析

国家自然科学基金

0+阅读 · 2012年12月31日

纳米粒子在复合物中分散性定量表征及与介电性关系

国家自然科学基金

0+阅读 · 2012年12月31日

低层错能镍基变形高温合金反常动态应变时效机理

国家自然科学基金

0+阅读 · 2011年12月31日

Li3V2(PO4)3快离子导体掺杂改性LiMnPO4/C纳米复合材料的基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

低碳高锰TRIP/TWIP效应共生钢的变形机制和组织演变

国家自然科学基金

0+阅读 · 2009年12月31日

Ga、Al、In氮化物及其合金和径向异质结纳米线的可控制备和物性研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员