ERANNs:用于识别音频模式的有效残余音频神经网络 (ERANNs: Efficient Residual Audio Neural Networks for Audio Pattern Recognition)

We present a new architecture of convolutional neural networks (CNNs) based on ResNet for audio pattern recognition tasks. The main modification is introducing a new hyper-parameter for decreasing temporal sizes of tensors with increased stride sizes which we call "the decreasing temporal size parameter". Optimal values of this parameter decrease the number of multi-adds that make the system faster. This approach not only decreases computational complexity but it can save and even increase (for the AudioSet dataset) the performance for audio pattern recognition tasks. This observation can be confirmed by experiments on three datasets: the AudioSet dataset, the ESC-50 dataset, and RAVDESS. Our best system achieves the state-of-the-art performance on the AudioSet dataset with mAP of 0.450. We also transfer a model pre-trained on the AudioSet dataset to the ESC-50 dataset and RAVDESS and obtain the state-of-the-art results with accuracies of 0.961 and 0.748, respectively. We call our system "ERANN" (Efficient Residual Audio Neural Network).

翻译：我们根据RESNet为音频模式识别任务提出了一个新的进化神经网络架构(CNNs) 。主要的修改是引入一个新的超参数, 用于减少时间尺寸增大的变速器的时间尺寸, 我们称之为“ 时间大小降低的参数 ” 。这个参数的最佳值会减少使系统更快的多添加数。这个方法不仅降低计算复杂性,而且可以保存甚至增加( 音频Set数据集) 音频模式识别任务的性能。这个观测可以通过三个数据集的实验得到证实: 音频卫星数据集、 ESC- 50 数据集和 RAVDESS。我们的最佳系统实现了与0. 450 的音频卫星数据集上最先进的性能。我们还将预先训练的音频卫星数据集模型转让给 ESC- 50 数据集和 RAVDESS, 并分别获得0. 961 和 0. 748 的状态技术结果。我们称之为“ ERANNE” ( Effal Solutional Neal Net) 。

相关内容

Pattern Recognition

关注 986

模式识别是一个成熟的、令人兴奋的、快速发展的领域，它支撑着计算机视觉、图像处理、文本和文档分析以及神经网络等相关领域的发展。它与机器学习非常相似，在生物识别、生物信息学、多媒体数据分析和最新的数据科学等新兴领域也有应用。模式识别（Pattern Recognition）杂志成立于大约50年前，当时该领域刚刚出现计算机科学的早期。在这期间，它已大大扩大。只要这些论文的背景得到了清晰的解释并以模式识别文献为基础，该杂志接受那些对模式识别理论、方法和在任何领域的应用做出原创贡献的论文。官网地址：http://dblp.uni-trier.de/db/conf/par/

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

【CVPR2020-Oral-中科院自动化所】元人脸识别，Learning Meta Face Recognition

专知会员服务

24+阅读 · 2020年3月20日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日