This report presents the dataset and baseline of Task 3 of the DCASE2021 Challenge on Sound Event Localization and Detection (SELD). The dataset is based on emulation of real recordings of static or moving sound events under real conditions of reverberation and ambient noise, using spatial room impulse responses captured in a variety of rooms and delivered in two spatial formats. The acoustical synthesis remains the same as in the previous iteration of the challenge, however the new dataset brings more challenging conditions of polyphony and overlapping instances of the same class. The most important difference of the new dataset is the introduction of directional interferers, meaning sound events that are localized in space but do not belong to the target classes to be detected and are not annotated. Since such interfering events are expected in every real-world scenario of SELD, the new dataset aims to promote systems that deal with this condition effectively. A modified SELDnet baseline employing the recent ACCDOA representation for SELD problems accompanies the dataset and is described herein. To investigate the individual and combined effects of ambient noise, interferers, and reverberation, we study the performance of the baseline on different versions of the dataset excluding or including combinations of these factors. The results indicate that by far the most detrimental effects are caused by directional interferers.
翻译:本报告介绍了DCASE2021 " 健康事件定位和探测挑战 " (SELD)任务3的数据集和基线。数据集以模拟真实记录在真实反响和环境噪音条件下静态或移动声音事件为基础,使用在各种房间捕捉的空间室脉冲反应反应,并以两种空间格式提供。声学合成与以前对挑战的迭代相同,然而,新的数据集带来了同一类多功能和重叠案例的更具挑战性的条件。新数据集的最重要区别是引入方向干扰器,即空间中已存在但不属于要检测的目标类别,且没有附加说明。由于在SELD的每一个现实世界情景中都预期会发生这种干扰事件,因此新数据集的目的是促进有效处理这一状况的系统。我们研究的是SELDnet的修改基准基线基准,使用最近ACDOA对SLD问题的代表性,与数据集相匹配,并在此描述。调查环境噪音、干扰器和反动因素的个别和组合影响,包括最有害的组合作用。我们研究的是,这些基线结果的演化方式,排除了这些影响。