The quality grading of mangoes is a crucial task for mango growers as it vastly affects their profit. However, until today, this process still relies on laborious efforts of humans, who are prone to fatigue and errors. To remedy this, the paper approaches the grading task with various convolutional neural networks (CNN), a tried-and-tested deep learning technology in computer vision. The models involved include Mask R-CNN (for background removal), the numerous past winners of the ImageNet challenge, namely AlexNet, VGGs, and ResNets; and, a family of self-defined convolutional autoencoder-classifiers (ConvAE-Clfs) inspired by the claimed benefit of multi-task learning in classification tasks. Transfer learning is also adopted in this work via utilizing the ImageNet pretrained weights. Besides elaborating on the preprocessing techniques, training details, and the resulting performance, we go one step further to provide explainable insights into the model's working with the help of saliency maps and principal component analysis (PCA). These insights provide a succinct, meaningful glimpse into the intricate deep learning black box, fostering trust, and can also be presented to humans in real-world use cases for reviewing the grading results.
翻译:芒果的质量定级对于芒果种植者来说是一项关键的任务,因为它极大地影响到他们的利润。然而,直到今天,这一进程仍然依赖于人类的勤奋努力,他们容易疲劳和错误。为了纠正这一点,纸张与各种进化神经网络(CNN)接触了分级任务,这是计算机视觉方面经过试验和测试的深层学习技术。模型包括Mask R-CNN(背景清除),图像网络挑战的众多过去赢家,即AlexNet、VGG和ResNets;以及自定义的自定义自成一体的自成一体的自成一体的自动计算机级化器(ConvAE-Clfs),其灵感来自分类任务中多任务学习的所谓好处。在这项工作中,还利用图像网络预先限制的重量进行转移学习。除了阐述预处理技术、培训细节和由此产生的表现外,我们还进一步提供可以解释的对模型工作的深入了解,借助突出的地图和主要组成部分分析(PCA)。这些深刻的洞察力为真实的深层学习黑盒提供了一个清晰的洞察力,并且可以审查。