Insects are a crucial part of our ecosystem. Sadly, in the past few decades, their numbers have worryingly decreased. In an attempt to gain a better understanding of this process and monitor the insects populations, Deep Learning may offer viable solutions. However, given the breadth of their taxonomy and the typical hurdles of fine grained analysis, such as high intraclass variability compared to low interclass variability, insect classification remains a challenging task. There are few benchmark datasets, which impedes rapid development of better AI models. The annotation of rare species training data, however, requires expert knowledge. Explainable Artificial Intelligence (XAI) could assist biologists in these annotation tasks, but choosing the optimal XAI method is difficult. Our contribution to these research challenges is threefold: 1) a dataset of thoroughly annotated images of wild bees sampled from the iNaturalist database, 2) a ResNet model trained on the wild bee dataset achieving classification scores comparable to similar state-of-the-art models trained on other fine-grained datasets and 3) an investigation of XAI methods to support biologists in annotation tasks.
翻译:可悲的是,在过去几十年中,昆虫的数量已经令人担忧地减少。为了更好地了解这一过程并监测昆虫种群,深学习可以提供可行的解决办法。然而,鉴于其分类学的广度以及细细粒分析的典型障碍,如与低阶级间变异相比,高阶级内部变异性以及低阶级间变异性,昆虫分类仍然是一项艰巨的任务。很少有基准数据集,这阻碍了更好的AI模型的快速发展。然而,稀有物种培训数据的注释需要专家知识。可解释的人工智能(XAI)可以帮助生物学家进行这些批注任务,但选择最佳的XAI方法是困难的。我们对这些研究挑战的贡献有三重:(1) 由从iNatalist数据库抽样的、 一组完全附加注释的野生生物图像组成的数据集,(2) 对野生蜂数据集进行了培训的ResNet模型,其分类分数可与其他精细的数据集相似,3) 对XAI方法进行调查,以支持生物学家进行注解任务。