Automatic smoky vehicle detection in videos is a superior solution to the traditional expensive remote sensing one with ultraviolet-infrared light devices for environmental protection agencies. However, it is challenging to distinguish vehicle smoke from shadow and wet regions coming from rear vehicle or clutter roads, and could be worse due to limited annotated data. In this paper, we first introduce a real-world large-scale smoky vehicle dataset with 75,000 annotated smoky vehicle images, facilitating the effective training of advanced deep learning models. To enable fair algorithm comparison, we also build a smoky vehicle video dataset including 163 long videos with segment-level annotations. Moreover, we present a new Coarse-to-fine Deep Smoky vehicle detection (CoDeS) framework for efficient smoky vehicle detection. The CoDeS first leverages a light-weight YOLO detector for fast smoke detection with high recall rate, and then applies a smoke-vehicle matching strategy to eliminate non-vehicle smoke, and finally uses a elaborately-designed 3D model to further refine the results in spatial temporal space. Extensive experiments in four metrics demonstrate that our framework is significantly superior to those hand-crafted feature based methods and recent advanced methods. The code and dataset will be released at https://github.com/pengxj/smokyvehicle.
翻译:视频中的自动烟雾车辆探测是传统昂贵的遥感方法的更好解决办法,其中为环境保护机构提供了紫外红外光灯设备;然而,将车辆烟雾与来自后车或杂乱道路的阴暗和湿区域区分开来是具有挑战性的,而且由于附加说明的数据有限,情况可能更糟。在本文件中,我们首先推出一个具有75 000个附加说明的烟雾车辆图像的真实世界大规模烟雾车辆数据集,便利对先进的深层学习模型进行有效培训。为了能够进行公平的算法比较,我们还建立了一个烟雾车辆录像数据集,包括163个长的带分级说明的视频。此外,我们提出了一个新的用于高效烟雾车辆探测的Coarse-fine Deep Smoky车辆探测框架。CoDeS首先利用一个轻型的YOLO探测器,以高回想率快速检测烟雾,然后采用一种烟雾与车辆匹配战略,最后使用精心设计的3D模型,以进一步改进空间时空空间空间空间空间空间空间的结果。在四度上进行广泛的实验,四度上进行广泛的实验,表明我们的框架将大大优于这些先进方法。