In this paper, we present a comprehensive analysis of Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording from its acoustic signature. In particular, we firstly propose an inception-based and low footprint ASC model, referred to as the ASC baseline. The proposed ASC baseline is then compared with benchmark and high-complexity network architectures of MobileNetV1, MobileNetV2, VGG16, VGG19, ResNet50V2, ResNet152V2, DenseNet121, DenseNet201, and Xception. Next, we improve the ASC baseline by proposing a novel deep neural network architecture which leverages residual-inception architectures and multiple kernels. Given the novel residual-inception (NRI) model, we further evaluate the trade off between the model complexity and the model accuracy performance. Finally, we evaluate whether sound events occurring in a sound scene recording can help to improve ASC accuracy, then indicate how a sound scene context is well presented by combining both sound scene and sound event information. We conduct extensive experiments on various ASC datasets, including Crowded Scenes, IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events (DCASE) 2018 Task 1A and 1B, 2019 Task 1A and 1B, 2020 Task 1A, 2021 Task 1A, 2022 Task 1. The experimental results on several different ASC challenges highlight two main achievements; the first is to propose robust, general, and low complexity ASC systems which are suitable for real-life applications on a wide range of edge devices and mobiles; the second is to propose an effective visualization method for comprehensively presenting a sound scene context.
翻译:在本文中,我们展示了声学场景分类(ASC)的全面分析,这是从其声学签名中识别音频录音现场的任务。特别是,我们首先提出一个基于初始和低足迹的ASC模型,称为ASC基线。随后,我们将拟议的ASC基线与移动NetV1、移动NetV2、VGG16、VGG19、ResNet50V2、ResNet1525V2、DenseNet121、DenseNet201和Xcepion的基准进行比较。接着,我们改进了ASC基线,提出一个新的深层神经网络结构,利用残余感知应用架构和多个内核。鉴于新的残余感知(NRI)模型,我们进一步评估模型复杂性和模型准确性性能之间的交易。最后,我们评估在声音现场记录中发生的声音事件是否有助于提高ASC的准确性,然后通过将声学场景和声音事件范围都同时显示。我们在各种ASC-SEVA系统首层数据系统上进行了广泛的实验,包括Crow-Slievial A Galal A和IESCA 1号任务1 ASlistrational 和IEISCA和ISLT Fleval 2019任务中,A和ISlix ASal 20SLA和ISLT Fl 20SLA和I FT FSA和I FT FT A 20 20SI FT A SLT FT A 上提出一个任务中的一项实际性任务中的一项实际性任务中的一项成果提议。