Our goal is to develop a sound event localization and detection (SELD) system that works robustly in unknown environments. A SELD system trained on known environment data is degraded in an unknown environment due to environmental effects such as reverberation and noise not contained in the training data. Previous studies on related tasks have shown that domain adaptation methods are effective when data on the environment in which the system will be used is available even without labels. However adaptation to unknown environments remains a difficult task. In this study, we propose echo-aware feature refinement (EAR) for SELD, which suppresses environmental effects at the feature level by using additional spatial cues of the unknown environment obtained through measuring acoustic echoes. FOA-MEIR, an impulse response dataset containing over 100 environments, was recorded to validate the proposed method. Experiments on FOA-MEIR show that the EAR effectively improves SELD performance in unknown environments.
翻译:我们的目标是开发一个健全的事件定位和探测(SELD)系统,该系统在未知环境中运作良好,对已知环境数据进行了培训,该系统在未知环境中退化,原因是环境影响,如反应和培训数据中未包含的噪音等,在未知环境中发生退化。以前关于相关任务的研究显示,如果即使没有标签,也能获得关于使用该系统的环境的数据,则域适应方法是有效的。然而,适应未知环境仍然是一项困难的任务。在本研究中,我们提议对SELD进行回声觉特征改进(EAR),该系统利用测量声回声获得的未知环境空间信号,抑制地貌层面的环境影响。FOA-MEIR是包含100多个环境的脉冲反应数据集,记录了用于验证拟议方法的功能。FOA-MEIR实验显示,EAR在未知环境中有效地改善了SELD的性能。