声音事件探测变异器:以事件为基础的最后到最后检测声音事件模型 (Sound Event Detection Transformer: An Event-based End-to-End Model for Sound Event Detection)

Sound event detection (SED) has gained increasing attention with its wide application in surveillance, video indexing, etc. Existing models in SED mainly generate frame-level predictions, converting it into a sequence multi-label classification problem, which inevitably brings a trade-off between event boundary detection and audio tagging when using weakly labeled data to train the model. Besides, it needs post-processing and cannot be trained in an end-to-end way. This paper firstly presents the 1D Detection Transformer (1D-DETR), inspired by Detection Transformer. Furthermore, given the characteristics of SED, the audio query and a one-to-many matching strategy for fine-tuning the model are added to 1D-DETR to form the model of Sound Event Detection Transformer (SEDT), which generates event-level predictions, end-to-end detection. Experiments are conducted on the URBAN-SED dataset and the DCASE2019 Task4 dataset, and both experiments have achieved competitive results compared with SOTA models. The application of SEDT on SED shows that it can be used as a framework for one-dimensional signal detection and may be extended to other similar tasks.

翻译：SED的现有模型主要产生框架级预测,将其转换成一个序列多标签分类问题,这不可避免地在使用贴有标签的薄弱数据来训练模型时使事件边界探测和音频标记之间产生权衡。此外,它需要后处理,无法接受端到端方式的培训。本文首先介绍了1D探测变异器(1D-DETR),这是由Setective变异器所启发的。此外,鉴于SEDD的特性,音频查询和微调模型的一至倍匹配战略被添加到1D-DETR,形成事件探测变异器(SEDT)的模型,产生事件级预测,端到端检测。对UBAN-SED数据集和DCASE2019任务4数据集进行了实验,这两项实验都取得了与SOTA模型相比的竞争结果。SED对SD的应用表明,它可以用作一维信号检测的框架,并可以推广到其他任务。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知会员服务

33+阅读 · 2020年10月11日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日