It is easier to hear birds than see them, however, they still play an essential role in nature and they are excellent indicators of deteriorating environmental quality and pollution. Recent advances in Machine Learning and Convolutional Neural Networks allow us to detect and classify bird sounds, by doing this, we can assist researchers in monitoring the status and trends of bird populations and biodiversity in ecosystems. We propose a sound detection and classification pipeline for analyzing complex soundscape recordings and identify birdcalls in the background. Our pipeline learns from weak labels, classifies fine-grained bird vocalizations in the wild, and is robust against background sounds (e.g., airplanes, rain, etc). Our solution achieved 10th place of 816 teams at the BirdCLEF 2021 Challenge hosted on Kaggle.
翻译:然而,听到鸟类比看到鸟类更容易,但它们在自然中仍然发挥着必不可少的作用,它们是环境质量和污染恶化的极好指标。机器学习和进化神经网络的最近进展使我们能够通过这样做,发现和分类鸟类声音,从而可以协助研究人员监测鸟类种群和生态系统生物多样性的状况和趋势。我们建议建立一个健全的检测和分类管道,用于分析复杂的声景记录和识别背景中的鸟类呼叫。我们的管道从薄弱标签中学习,对野生细小的鸟类声响进行分类,并且根据背景声音(如飞机、雨等)保持稳健。我们的解决方案在Kagle的BirdCLEF 2021挑战中达到了816个团队的第10位。