尾端至尾端音频打击回击:促进增强高效音频分类网络 (End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network) - 专知论文

会员服务 ·

0

端到端 · state-of-the-art · Boosting（一种模型训练加速方式） · Networking · 讲稿 ·

2022 年 4 月 29 日

End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network

翻译：尾端至尾端音频打击回击:促进增强高效音频分类网络

Avi Gazneli,Gadi Zimerman,Tal Ridnik,Gilad Sharir,Asaf Noy

While efficient architectures and a plethora of augmentations for end-to-end image classification tasks have been suggested and heavily investigated, state-of-the-art techniques for audio classifications still rely on numerous representations of the audio signal together with large architectures, fine-tuned from large datasets. By utilizing the inherited lightweight nature of audio and novel audio augmentations, we were able to present an efficient end-to-end network with strong generalization ability. Experiments on a variety of sound classification sets demonstrate the effectiveness and robustness of our approach, by achieving state-of-the-art results in various settings. Public code will be available.

翻译：虽然提出了高效的架构和大量用于端到端图像分类任务的扩增结构,并对此进行了大量调查,但最先进的音频分类技术仍然依靠大量音频信号和大型结构的表述,并参照大型数据集进行微调。通过利用音频和新音频扩增所遗留的轻量级性质,我们得以展示一个高效的端到端网络,具有很强的简单化能力。关于各种健全的分类组的实验表明我们的方法的有效性和稳健性,在各种环境中取得了最新的结果。公共代码将存在。

0

相关内容

端到端

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

开放量子系统非马尔科夫动力学过程量子仿真研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于高维光晶格的超强模拟规范场研究

国家自然科学基金

0+阅读 · 2014年12月31日

疏水性离子液体电沉积镍铁合金微观结构调控及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

碳源胁迫对颗粒污泥稳定性及除磷特性的影响及机制

国家自然科学基金

0+阅读 · 2013年12月31日

利用离子阱颗粒质谱对微球表面吸附的定量表征

国家自然科学基金

0+阅读 · 2013年12月31日

中层大气—电离层（MAI）探测系统研究

国家自然科学基金

0+阅读 · 2012年12月31日

可生物降解固体碳源生物膜的形成机理及脱氮特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

孔道结构稀土－有机配位聚合物的合成、荧光识别及吸附性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

多复变函数空间上的复合算子与Toeplitz算子

国家自然科学基金

1+阅读 · 2009年12月31日

超支化聚合物/硅的多维、多尺度自组装研究

国家自然科学基金

0+阅读 · 2008年12月31日

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

Arxiv

0+阅读 · 2022年6月16日

AVATAR: Unconstrained Audiovisual Speech Recognition

Arxiv

0+阅读 · 2022年6月15日

Self-Supervised Implicit Attention: Guided Attention by The Model Itself

Arxiv

0+阅读 · 2022年6月15日

A Survey on Data Augmentation for Text Classification

A Survey on Data Augmentation for Text Classification

Arxiv

16+阅读 · 2021年7月7日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Multi-Label Text Classification using Attention-based Graph Neural Network

Arxiv

46+阅读 · 2020年3月22日

A survey on Semi-, Self- and Unsupervised Techniques in Image Classification

A survey on Semi-, Self- and Unsupervised Techniques in Image Classification

Arxiv

100+阅读 · 2020年2月20日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Attention-based Ensemble for Deep Metric Learning

Arxiv

17+阅读 · 2018年4月2日

Deep Metric Learning with BIER: Boosting Independent Embeddings Robustly

Arxiv

18+阅读 · 2018年1月15日

VIP会员

文章信息

相关主题

state-of-the-art

Boosting（一种模型训练加速方式）

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】《知识图谱与大语言模型的协同应用》，544页pdf

军事通信系统：安全行动的支柱

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

【新书】机器学习系统，2620页pdf

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

Arxiv

0+阅读 · 2022年6月16日

AVATAR: Unconstrained Audiovisual Speech Recognition

Arxiv

0+阅读 · 2022年6月15日

Self-Supervised Implicit Attention: Guided Attention by The Model Itself

Arxiv

0+阅读 · 2022年6月15日

A Survey on Data Augmentation for Text Classification

A Survey on Data Augmentation for Text Classification

Arxiv

16+阅读 · 2021年7月7日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Multi-Label Text Classification using Attention-based Graph Neural Network

Arxiv

46+阅读 · 2020年3月22日

A survey on Semi-, Self- and Unsupervised Techniques in Image Classification

A survey on Semi-, Self- and Unsupervised Techniques in Image Classification

Arxiv

100+阅读 · 2020年2月20日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Attention-based Ensemble for Deep Metric Learning

Arxiv

17+阅读 · 2018年4月2日

Deep Metric Learning with BIER: Boosting Independent Embeddings Robustly

Arxiv

18+阅读 · 2018年1月15日

相关基金

开放量子系统非马尔科夫动力学过程量子仿真研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于高维光晶格的超强模拟规范场研究

国家自然科学基金

0+阅读 · 2014年12月31日

疏水性离子液体电沉积镍铁合金微观结构调控及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

碳源胁迫对颗粒污泥稳定性及除磷特性的影响及机制

国家自然科学基金

0+阅读 · 2013年12月31日

利用离子阱颗粒质谱对微球表面吸附的定量表征

国家自然科学基金

0+阅读 · 2013年12月31日

中层大气—电离层（MAI）探测系统研究

国家自然科学基金

0+阅读 · 2012年12月31日

可生物降解固体碳源生物膜的形成机理及脱氮特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

孔道结构稀土－有机配位聚合物的合成、荧光识别及吸附性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

多复变函数空间上的复合算子与Toeplitz算子

国家自然科学基金

1+阅读 · 2009年12月31日

超支化聚合物/硅的多维、多尺度自组装研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员