The detection and classification of vehicles on the road is a crucial task for traffic monitoring. Usually, Computer Vision (CV) algorithms dominate the task of vehicle classification on the road, but CV methodologies might suffer in poor lighting conditions and require greater amounts of computational power. Additionally, there is a privacy concern with installing cameras in sensitive and secure areas. In contrast, acoustic traffic monitoring is cost-effective, and can provide greater accuracy, particularly in low lighting conditions and in places where cameras cannot be installed. In this paper, we consider the task of acoustic vehicle sub-type classification, where we classify acoustic signals into 4 classes: car, truck, bike, and no vehicle. We experimented with Mel spectrograms, MFCC and GFCC as features and performed data pre-processing to train a simple, well optimized CNN that performs well at the task. When used with MFCC as features and careful data pre-processing, our proposed methodology improves upon the established state-of-the-art baseline on the IDMT Traffic dataset with an accuracy of 98.95%.
翻译:对公路上车辆的探测和分类是交通监测的一项关键任务,通常,计算机视像算法在公路上控制车辆分类任务,但CV方法在照明条件差的情况下可能受到影响,需要更大的计算能力;此外,在敏感和安全地区安装照相机是隐私问题;相反,音响交通监测具有成本效益,可以提供更大的准确性,特别是在低照明条件下和无法安装照相机的地方;在本文件中,我们考虑了声学车辆分型分类的任务,我们把声学信号分为4类:汽车、卡车、自行车和没有车辆;我们用Mel光谱、MFCC和GFCC作为特征进行了试验,并进行了数据预处理,以训练一个简单、优化的CNN,在执行任务时表现良好;在使用MFCC作为特点和仔细的数据预处理时,我们提出的方法将改进了IDMT交通数据集的既定最新基线,精确率为98.95%。