In this paper we present our system for the detection and classification of acoustic scenes and events (DCASE) 2020 Challenge Task 4: Sound event detection and separation in domestic environments. We introduce two new models: the forward-backward convolutional recurrent neural network (FBCRNN) and the tag-conditioned convolutional neural network (CNN). The FBCRNN employs two recurrent neural network (RNN) classifiers sharing the same CNN for preprocessing. With one RNN processing a recording in forward direction and the other in backward direction, the two networks are trained to jointly predict audio tags, i.e., weak labels, at each time step within a recording, given that at each time step they have jointly processed the whole recording. The proposed training encourages the classifiers to tag events as soon as possible. Therefore, after training, the networks can be applied to shorter audio segments of, e.g., 200 ms, allowing sound event detection (SED). Further, we propose a tag-conditioned CNN to complement SED. It is trained to predict strong labels while using (predicted) tags, i.e., weak labels, as additional input. For training pseudo strong labels from a FBCRNN ensemble are used. The presented system scored the fourth and third place in the systems and teams rankings, respectively. Subsequent improvements allow our system to even outperform the challenge baseline and winner systems in average by, respectively, 18.0% and 2.2% event-based F1-score on the validation set. Source code is publicly available at https://github.com/fgnt/pb_sed.
翻译:在本文中,我们展示了我们用于探测和分类声场和事件的系统(DCASE)2020 挑战任务4:在国内环境中对事件进行探测和分类。我们引入了两种新模式:前向后向后向的螺旋经常性神经网络(FBCRNN)和带有标签的卷状神经网络(CNN)。FBCRNN使用两个与前处理共用同一CNN的经常性神经网络(RNNN)分类器。一个RNN处理前向记录,另一个处理后向方向记录,两个网络接受培训,以便共同预测音频标签,即在国内环境中对事件进行正确检测和分离。两个网络经过培训,每一步在记录中联合处理整个记录。拟议的培训鼓励分类员尽快对事件进行标记。因此,在培训后,网络可以应用到更短的音频部分,例如200米。此外,我们提议由基于标签的CNNMED来补充SED。它经过培训,在使用强(预制的)系统、i.i.rental deal drial 和Fraldal deal scheal sche sche sche sche silveal sy sy sy sild systelation 上分别使用硬度系统, 和Fliver silveal silveal serveal silvealds sre she she sre she she sre she sre sre sre sre she she srealdald silds sild sild silveald sildalds sqs sqs sregresd.