Few-shot sound event detection is the task of detecting sound events, despite having only a few labelled examples of the class of interest. This framework is particularly useful in bioacoustics, where often there is a need to annotate very long recordings but the expert annotator time is limited. This paper presents an overview of the second edition of the few-shot bioacoustic sound event detection task included in the DCASE 2022 challenge. A detailed description of the task objectives, dataset, and baselines is presented, together with the main results obtained and characteristics of the submitted systems. This task received submissions from 15 different teams from which 13 scored higher than the baselines. The highest F-score was of 60% on the evaluation set, which leads to a huge improvement over last year's edition. Highly-performing methods made use of prototypical networks, transductive learning, and addressed the variable length of events from all target classes. Furthermore, by analysing results on each of the subsets we can identify the main difficulties that the systems face, and conclude that few-show bioacoustic sound event detection remains an open challenge.
翻译:微小的声学事件探测是探测声学事件的任务,尽管只贴上几个有关类别的例子。这个框架在生物学方面特别有用,因为常常需要说明非常长的录音,但专家说明的时间有限。本文概述了DCASE 2022 挑战中包含的微小生物声学事件探测任务的第二版。 详细介绍了任务目标、 数据集和基线, 以及所提交系统的主要结果和特点。 这项任务收到了15个不同小组的呈文, 其中13个得分高于基线。 最高F群在评估组上的比例为60%, 与去年的版本相比有很大改进。 高效方法利用了原型网络、 传输学习, 并解决了所有目标类别事件的不同长度。 此外, 通过分析每个子组的结果,我们可以查明系统面临的主要困难, 并得出结论, 少数生物声学事件探测仍是一个公开的挑战。