Tea chrysanthemum detection at its flowering stage is one of the key components for selective chrysanthemum harvesting robot development. However, it is a challenge to detect flowering chrysanthemums under unstructured field environments given the variations on illumination, occlusion and object scale. In this context, we propose a highly fused and lightweight deep learning architecture based on YOLO for tea chrysanthemum detection (TC-YOLO). First, in the backbone component and neck component, the method uses the Cross-Stage Partially Dense Network (CSPDenseNet) as the main network, and embeds custom feature fusion modules to guide the gradient flow. In the final head component, the method combines the recursive feature pyramid (RFP) multiscale fusion reflow structure and the Atrous Spatial Pyramid Pool (ASPP) module with cavity convolution to achieve the detection task. The resulting model was tested on 300 field images, showing that under the NVIDIA Tesla P100 GPU environment, if the inference speed is 47.23 FPS for each image (416 * 416), TC-YOLO can achieve the average precision (AP) of 92.49% on our own tea chrysanthemum dataset. In addition, this method (13.6M) can be deployed on a single mobile GPU, and it could be further developed as a perception system for a selective chrysanthemum harvesting robot in the future.
翻译:在开花阶段检测茶菊菊菊,是选择性切合菊花采集机器人开发的关键组成部分之一;然而,鉴于照明、隔热和物体规模的差异,在无结构的实地环境中检测开花的菊菊菊是一项挑战;在这方面,我们提议以YOLO为基础,在茶菊菊菊花检测(TC-YOLO)中建立一个高度结合和轻量的深学习结构。首先,在主干部分和颈部部分,该方法使用跨系统半晶菊花网络(CSPDenseNet)作为主网络,并嵌入自定义的聚合模块,以指导梯度流。在最后头部分,该方法将循环特性金字塔(RFP)多级熔化回流结构与Atroom Space Pyramid Pool (ASPP) 模块结合起来,该模块具有探测任务。该模型可在300个实地图像上测试,显示在NVDIA Tesla P100 GPUPE环境中,如果导导导速速度为47.23-TRAS系统,则可以将每个图像的精确度定位系统(47.MAS AS* ) 用于内部测测算。