Real-time object detection on Unmanned Aerial Vehicles (UAVs) is a challenging issue due to the limited computing resources of edge GPU devices as Internet of Things (IoT) nodes. To solve this problem, in this paper, we propose a novel lightweight deep learning architectures named FasterX based on YOLOX model for real-time object detection on edge GPU. First, we design an effective and lightweight PixSF head to replace the original head of YOLOX to better detect small objects, which can be further embedded in the depthwise separable convolution (DS Conv) to achieve a lighter head. Then, a slimmer structure in the Neck layer termed as SlimFPN is developed to reduce parameters of the network, which is a trade-off between accuracy and speed. Furthermore, we embed attention module in the Head layer to improve the feature extraction effect of the prediction head. Meanwhile, we also improve the label assignment strategy and loss function to alleviate category imbalance and box optimization problems of the UAV dataset. Finally, auxiliary heads are presented for online distillation to improve the ability of position embedding and feature extraction in PixSF head. The performance of our lightweight models are validated experimentally on the NVIDIA Jetson NX and Jetson Nano GPU embedded platforms.Extensive experiments show that FasterX models achieve better trade-off between accuracy and latency on VisDrone2021 dataset compared to state-of-the-art models.
翻译:在无人驾驶飞行器(UAVs)上实时物体探测是一个具有挑战性的问题,因为边缘 GPU 设备作为Things(IoT) 互联网节点的计算资源有限。 为了解决这个问题,我们在本文件中提出一个基于 YOLOX 模型的新型轻量深学习结构,名为PeappleX, 用于在边缘GPU进行实时物体探测。 首先,我们设计一个有效和轻量的PixSF头取代YOLOX的原始头部,以更好地探测小物体,这些小物体可以进一步嵌入深度的分解(DS Conv)中,以达到一个较轻的首级。然后,在Neck层中开发一个称为SlimFPN的较细结构,以降低网络参数,这是精确和速度之间的权衡。此外,我们将关注模块嵌入到头层,以改进预测头部的特征提取效果。 同时,我们还改进标签分配战略和损失功能,以缓解UAVAVSD公司数据集的分类不平衡和框优化问题。 最后,将辅助头部用于在线蒸馏,以提高定位模型的内置能力,以便在SISISBSBSBSBSB 和SBSBSBSB的精度测试和SB的精度模型上,在SBSBSBSBSBSBSBSB的精度的精度上,从而的精度测试。