衡量交通量预测模型信心:技术、实验比较及其可操作性准则 (Measuring the Confidence of Traffic Forecasting Models: Techniques, Experimental Comparison and Guidelines towards Their Actionability)

The estimation of the amount of uncertainty featured by predictive machine learning models has acquired a great momentum in recent years. Uncertainty estimation provides the user with augmented information about the model's confidence in its predicted outcome. Despite the inherent utility of this information for the trustworthiness of the user, there is a thin consensus around the different types of uncertainty that one can gauge in machine learning models and the suitability of different techniques that can be used to quantify the uncertainty of a specific model. This subject is mostly non existent within the traffic modeling domain, even though the measurement of the confidence associated to traffic forecasts can favor significantly their actionability in practical traffic management systems. This work aims to cover this lack of research by reviewing different techniques and metrics of uncertainty available in the literature, and by critically discussing how confidence levels computed for traffic forecasting models can be helpful for researchers and practitioners working in this research area. To shed light with empirical evidence, this critical discussion is further informed by experimental results produced by different uncertainty estimation techniques over real traffic data collected in Madrid (Spain), rendering a general overview of the benefits and caveats of every technique, how they can be compared to each other, and how the measured uncertainty decreases depending on the amount, quality and diversity of data used to produce the forecasts.

翻译：预测机器学习模型所显示的不确定性程度估计近年来的势头很大。不确定的估计为用户提供了关于模型对预测结果的信心的更多信息。尽管这种信息对用户的可信度具有内在的效用,但对于不同种类的不确定性,人们在机器学习模型和可用于量化特定模型不确定性的不同技术的适宜性方面可以衡量,但对于不同类型的不确定性,人们可以对不同类型的不确定性进行细微的共识,而这个主题在交通模型领域大多不存在,尽管对与交通预测有关的信心的衡量可大大促进其在实际交通管理系统中的可操作性。这项工作的目的是通过审查文献中现有的不同技术和不确定性的衡量标准来弥补这种研究的缺乏,并严格讨论计算交通预测模型的信心水平如何有助于在这一研究领域工作的研究人员和从业人员。为了了解经验证据,这一关键讨论还进一步参考了对马德里(西班牙)所收集的实际交通数据的不同不确定性估计技术所产生的实验结果,对每种技术的效益和洞察力都大有帮助,它们如何相互比较,以及根据使用的数据的数量、测量的不确定性如何减少。