Agricultural image recognition tasks are becoming increasingly dependent on deep learning (DL). Despite its excellent performance, it is difficult to comprehend what type of logic or features DL uses in its decision making. This has become a roadblock for the implementation and development of DL-based image recognition methods because knowing the logic or features used in decision making, such as in a classification task, is very important for verification, algorithm improvement, training data improvement, knowledge extraction, etc. To mitigate such problems, we developed a classification method based on a variational autoencoder architecture that can show not only the location of the most important features but also what variations of that particular feature are used. Using the PlantVillage dataset, we achieved an acceptable level of explainability without sacrificing the accuracy of the classification. Although the proposed method was tested for disease diagnosis in some crops, the method can be extended to other crops as well as other image classification tasks. In the future, we hope to use this explainable artificial intelligence algorithm in disease identification tasks, such as the identification of potato blackleg disease and potato virus Y (PVY), and other image classification tasks.
翻译:农业形象识别任务日益依赖深层学习(DL)。尽管表现出色,但很难理解DL在决策中使用哪种逻辑或特征。这已成为基于DL的图像识别方法的执行和发展的障碍,因为了解决策中所使用的逻辑或特征,例如分类任务,对于核查、算法改进、培训数据改进、知识提取等非常重要。为了减轻这些问题,我们开发了一种基于变式自动编码结构的分类方法,不仅能够显示最重要的特征的位置,而且能够显示该特征的变异性。我们利用植物植被数据集,在不牺牲分类准确性的情况下,实现了可接受的解释程度。虽然对一些作物的疾病诊断进行了测试,但该方法可以推广到其他作物以及其他图像分类任务。今后,我们希望在疾病识别任务中使用这种可以解释的人工智能算法,例如查明土豆黑腿病和马铃薯病毒Y(PVY)以及其他图像分类任务。