Accurate insect pest recognition is significant to protect the crop or take the early treatment on the infected yield, and it helps reduce the loss for the agriculture economy. Design an automatic pest recognition system is necessary because manual recognition is slow, time-consuming, and expensive. The Image-based pest classifier using the traditional computer vision method is not efficient due to the complexity. Insect pest classification is a difficult task because of various kinds, scales, shapes, complex backgrounds in the field, and high appearance similarity among insect species. With the rapid development of deep learning technology, the CNN-based method is the best way to develop a fast and accurate insect pest classifier. We present different convolutional neural network-based models in this work, including attention, feature pyramid, and fine-grained models. We evaluate our methods on two public datasets: the large-scale insect pest dataset, the IP102 benchmark dataset, and a smaller dataset, namely D0 in terms of the macro-average precision (MPre), the macro-average recall (MRec), the macro-average F1- score (MF1), the accuracy (Acc), and the geometric mean (GM). The experimental results show that combining these convolutional neural network-based models can better perform than the state-of-the-art methods on these two datasets. For instance, the highest accuracy we obtained on IP102 and D0 is $74.13\%$ and $99.78\%$, respectively, bypassing the corresponding state-of-the-art accuracy: $67.1\%$ (IP102) and $98.8\%$ (D0). We also publish our codes for contributing to the current research related to the insect pest classification problem.
翻译:准确的害虫昆虫识别对于保护作物或对受感染的产量进行早期处理非常重要,有助于减少农业经济的损失。设计自动的害虫识别系统是必要的,因为人工识别缓慢、耗时和昂贵。使用传统计算机视觉方法的图像化害虫分类器效率不高。昆虫分类是一项艰巨的任务,因为各种种类、规模、形状、实地背景复杂和昆虫物种之间外观相似。随着深入学习技术的迅速发展,基于CNN的绕行法是发展快速和准确的虫害分类器的最佳方法。我们展示了这项工作中以神经网络为基础的不同模型,包括关注度、特征金字塔和细细度模型。我们评估了两种公共数据集的方法:大型昆虫数据集、IP102基准数据集和小数据集,即宏观平均精确度(MPre)、宏观平均回顾(MREc)、宏观平均F1-102分类(MF1-10) 以及我们当前和正数的评分数(MF1),这些模型的精确度(Acc) 以及这些实验模型的精确度可以显示的准确性数据。