Research on sound event detection (SED) in environmental settings has seen increased attention in recent years. The large amounts of (private) domestic or urban audio data needed raise significant logistical and privacy concerns. The inherently distributed nature of these tasks, make federated learning (FL) a promising approach to take advantage of largescale data while mitigating privacy issues. While FL has also seen increased attention recently, to the best of our knowledge there is no research towards FL for SED. To address this gap and foster further research in this field, we create and publish novel FL datasets for SED in domestic and urban environments. Furthermore, we provide baseline results on the datasets in a FL context for three deep neural network architectures. The results indicate that FL is a promising approach for SED, but faces challenges with divergent data distributions inherent to distributed client edge devices.
翻译:近年来,关于环境环境中健全事件探测(SED)的研究日益受到重视,大量(私人)国内或城市音频数据需要大量(私人)的国内或城市音频数据引起了重大的后勤和隐私关切,这些任务固有的分布性质,使联合学习(FL)成为利用大规模数据,同时减少隐私问题的有希望的方法,虽然FL最近也看到越来越多的人关注我们的知识,但没有研究SED的FL。为了弥补这一差距,促进这一领域的进一步研究,我们为国内和城市环境中的SED创建和出版新的FL数据集。此外,我们提供了三种深层神经网络结构在FL背景下数据集的基线结果。结果显示,FL是SED的一个有希望的方法,但面临着分布客户边缘设备所固有的不同数据分布的挑战。