In this paper, we demonstrate a unique recipe to enhance the effectiveness of audio machine learning approaches by fusing pre-processing techniques into a deep learning model. Our solution accelerates training and inference performance by optimizing hyper-parameters through training instead of costly random searches to build a reliable mosquito detector from audio signals. The experiments and the results presented here are part of the MOS C submission of the ACM 2022 challenge. Our results outperform the published baseline by 212% on the unpublished test set. We believe that this is one of the best real-world examples of building a robust bio-acoustic system that provides reliable mosquito detection in noisy conditions.
翻译:在本文中,我们展示了一种独特的方法,通过将预处理技术纳入深层学习模式来提高音频机器学习方法的有效性。我们的解决方案通过培训而不是用昂贵的随机搜索来从音频信号中建立可靠的蚊子探测器来优化超参数来加速培训和推断性能。这里介绍的实验和结果是MOS C提交的ACM 2022挑战的一部分。我们的结果比未公布的测试集上公布的基线高出了212 % 。 我们相信,这是建立强大的生物声学系统,在噪音条件下提供可靠的蚊子检测的最佳实例之一。