Traffic accidents have been a severe issue in metropolises with the development of traffic flow. This paper explores the theory and application of a recently developed machine learning technique, namely Import Vector Machines (IVMs), in real-time crash risk analysis, which is a hot topic to reduce traffic accidents. Historical crash data and corresponding traffic data from Shanghai Urban Expressway System were employed and matched. Traffic conditions are labelled as dangerous (i.e. probably leading to a crash) and safe (i.e. a normal traffic condition) based on 5-minute measurements of average speed, volume and occupancy. The IVM algorithm is trained to build the classifier and its performance is compared to the popular and successfully applied technique of Support Vector Machines (SVMs). The main findings indicate that IVMs could successfully be employed in real-time identification of dangerous pro-active traffic conditions. Furthermore, similar to the "support points" of the SVM, the IVM model uses only a fraction of the training data to index kernel basis functions, typically a much smaller fraction than the SVM, and its classification rates are similar to those of SVMs. This gives the IVM a computational advantage over the SVM, especially when the size of the training data set is large.
翻译:由于交通流量的发展,交通事故在地铁交通中是一个严重问题。本文件探讨了最近开发的机器学习技术(即进口病媒机(IVMs)的理论和应用,即进口病媒机(IVMs)的实时碰撞风险分析,这是一个减少交通事故的热题。使用和匹配了上海城市高速公路系统的历史坠机数据和相应的交通数据。交通条件被贴上危险(即可能导致坠毁)和安全(即正常交通条件)的标签,根据平均速度、体积和占用的5分钟测量结果。IVM算法是用来建造分类器的,其性能与流行和成功应用的支持病媒机(SVMs)技术相比较。主要调查结果表明,IVM可成功用于实时识别危险的主动交通条件。此外,与SVM的“支持点”相似,IVM模型仅使用培训数据的一部分用于索引内核功能,通常比SVM少得多,其分类率与SVMs的分类率相似。这给SVM的大规模测试带来了IVM的优势。