The automatic detection of major rail components using railway images is beneficial to ensure the rail transport safety. In this paper, we propose an attention-powered deep convolutional network (AttnConv-net) to detect multiple rail components including the rail, clips, and bolts. The proposed method consists of a deep convolutional neural network (DCNN) as the backbone, cascading attention blocks (CAB), and two feed forward networks (FFN). Two types of positional embedding are applied to enrich information in latent features extracted from the backbone. Based on processed latent features, the CAB aims to learn the local context of rail components including their categories and component boundaries. Final categories and bounding boxes are generated via two FFN implemented in parallel. To enhance the detection of small components, various data augmentation methods are employed in the training process. The effectiveness of the proposed AttnConv-net is validated with one real dataset and another synthesized dataset. Compared with classic convolutional neural network based methods, our proposed method simplifies the detection pipeline by eliminating the need of prior- and post-processing, which offers a new speed-quality solution to enable faster and more accurate image-based rail component detections
翻译:使用铁路图象自动探测主要铁路部件有助于确保铁路运输安全。在本文件中,我们建议建立一个关注力深革命网络(AttnConv-net),以探测包括铁路、弹夹和螺栓在内的多个铁路部件。拟议方法包括一个深革命神经网络(DCNN),作为主干线、导流关注区(CAB)和两个前方网络(FFN)),以自动探测主要铁路部件。使用两种定位嵌入方式来丰富从骨干提取的潜质特征中的信息。根据加工的潜在特征,CAB旨在了解铁路部件的当地背景,包括其类别和组成部分边界。最后类别和捆绑盒是通过平行执行的两个FFFFFN生成的。为了加强对小部件的探测,在培训过程中采用了各种数据增强方法。拟议的AttnCon-net的有效性通过一个真实的数据集和另一个合成数据集得到验证。与经典的光子神经网络基于方法相比,我们拟议的方法简化了探测管道,通过消除基于前和后处理的部件需要,从而能够更快和更快地实现新的铁路图像质量的解决办法。