Real-time vision-based system of fault detection (RVBS-FD) for freight trains is an essential part of ensuring railway transportation safety. Most existing vision-based methods still have high computational costs based on convolutional neural networks. The computational cost is mainly reflected in the backbone, neck, and post-processing, i.e., non-maximum suppression (NMS). In this paper, we propose a lightweight NMS-free framework to achieve real-time detection and high accuracy simultaneously. First, we use a lightweight backbone for feature extraction and design a fault detection pyramid to process features. This fault detection pyramid includes three novel individual modules using attention mechanism, bottleneck, and dilated convolution for feature enhancement and computation reduction. Instead of using NMS, we calculate different loss functions, including classification and location costs in the detection head, to further reduce computation. Experimental results show that our framework achieves over 83 frames per second speed with a smaller model size and higher accuracy than the state-of-the-art detectors. Meanwhile, the hardware resource requirements of our method are low during the training and testing process.
翻译:货运列车的基于实时视像的故障探测系统(RVBS-FD)是确保铁路运输安全的一个基本部分,大多数现有基于视像的方法仍然具有基于进化神经网络的高计算成本,计算成本主要反映在骨干、颈部和后处理,即非最大抑制(NMS)中。在本文中,我们提议了一个不使用轻量级NMS的框架,以便同时实现实时检测和高准确性。首先,我们使用一个轻量脊椎进行特征提取,并设计一个处理特征的故障检测金字塔。这种故障检测金字塔包括三个新的个体模块,使用注意机制、瓶颈和放大变速器来增强和减少特征。我们使用NMS计算不同的损失功能,包括检测头的分类和定位成本,以进一步降低计算。实验结果表明,我们的框架每秒达到83个以上的框架,其模型大小小于和精确度高于最先进的探测器。与此同时,我们方法的硬件资源需求在培训和测试过程中很低。