Agricultural image recognition tasks are becoming increasingly dependent on deep learning (DL); however, despite the excellent performance of DL, it is difficult to comprehend the type of logic or features of the input image it uses during decision making. Knowing the logic or features is highly crucial for result verification, algorithm improvement, training data improvement, and knowledge extraction. However, the explanations from the current heatmap-based algorithms are insufficient for the abovementioned requirements. To address this, this paper details the development of a classification and explanation method based on a variational autoencoder (VAE) architecture, which can visualize the variations of the most important features by visualizing the generated images that correspond to the variations of those features. Using the PlantVillage dataset, an acceptable level of explainability was achieved without sacrificing the classification accuracy. The proposed method can also be extended to other crops as well as other image classification tasks. Further, application systems using this method for disease identification tasks, such as the identification of potato blackleg disease, potato virus Y, and other image classification tasks, are currently being developed.
翻译:农业形象的识别任务越来越依赖于深层次的学习(DL);然而,尽管DL的出色表现,但很难理解它在决策中所使用的输入图像的逻辑或特征的类型。了解逻辑或特征对于结果核查、算法改进、培训数据改进和知识提取至关重要。然而,目前基于热映算法的解释对于上述要求来说是不够的。为此,本文件详细说明了基于变式自动编码(VAE)结构的分类和解释方法的开发情况,该结构可以通过直观地显示与这些特征的变异相对应的生成图像的变异性,从而直观地描绘出最重要的特征。利用Plant Village数据集,在不牺牲分类准确性的前提下实现了可接受的可解释性。拟议方法还可以推广到其他作物以及其他图像分类任务。此外,目前正在开发使用这种方法进行疾病识别任务的应用系统,例如查明土豆黑腿病、土豆病毒Y和其他图像分类任务。