Data-driven forecasts of air quality have recently achieved more accurate short-term predictions. Despite their success, most of the current data-driven solutions lack proper quantifications of model uncertainty that communicate how much to trust the forecasts. Recently, several practical tools to estimate uncertainty have been developed in probabilistic deep learning. However, there have not been empirical applications and extensive comparisons of these tools in the domain of air quality forecasts. Therefore, this work applies state-of-the-art techniques of uncertainty quantification in a real-world setting of air quality forecasts. Through extensive experiments, we describe training probabilistic models and evaluate their predictive uncertainties based on empirical performance, reliability of confidence estimate, and practical applicability. We also propose improving these models using "free" adversarial training and exploiting temporal and spatial correlation inherent in air quality data. Our experiments demonstrate that the proposed models perform better than previous works in quantifying uncertainty in data-driven air quality forecasts. Overall, Bayesian neural networks provide a more reliable uncertainty estimate but can be challenging to implement and scale. Other scalable methods, such as deep ensemble, Monte Carlo (MC) dropout, and stochastic weight averaging-Gaussian (SWAG), can perform well if applied correctly but with different tradeoffs and slight variations in performance metrics. Finally, our results show the practical impact of uncertainty estimation and demonstrate that, indeed, probabilistic models are more suitable for making informed decisions. Code and dataset are available at \url{https://github.com/Abdulmajid-Murad/deep_probabilistic_forecast}
翻译:由数据驱动的空气质量预测最近取得了更准确的短期预测。尽管取得了成功,但目前大多数由数据驱动的解决方案都缺乏适当的模型不确定性量化,无法说明如何相信预测。最近,在概率深厚的深层次学习中开发了几种实际工具来估计不确定性。然而,在空气质量预测领域,没有经验应用和对这些工具的广泛比较。因此,这项工作在现实世界空气质量预测环境中采用最先进的不确定性量化技术。通过广泛的实验,我们描述培训稳定模型,并根据经验性、信心估计的可靠性和实际适用性来评估其预测不确定性。我们还提议利用“免费”对抗性培训来改进这些模型,并利用空气质量数据中固有的时间和空间相关性。我们的实验表明,在量化数据驱动空气质量预测中的不确定性方面,拟议的模型比以前的工作表现得更好。总体而言,贝亚神经网络提供了更可靠的不确定性估算,但对于实施和规模而言具有挑战性。其他可变的方法,如深内容可读性、蒙特卡洛(MC)的辍学和准确性估算,最终的准确性能显示我们的实际性评估结果。