The ability to forecast the concentration of air pollutants in an urban region is crucial for decision-makers wishing to reduce the impact of pollution on public health through active measures (e.g. temporary traffic closures). In this study, we present a machine learning approach applied to the forecast of the day-ahead maximum value of the ozone concentration for several geographical locations in southern Switzerland. Due to the low density of measurement stations and to the complex orography of the use case terrain, we adopted feature selection methods instead of explicitly restricting relevant features to a neighbourhood of the prediction sites, as common in spatio-temporal forecasting methods. We then used Shapley values to assess the explainability of the learned models in terms of feature importance and feature interactions in relation to ozone predictions; our analysis suggests that the trained models effectively learned explanatory cross-dependencies among atmospheric variables. Finally, we show how weighting observations helps in increasing the accuracy of the forecasts for specific ranges of ozone's daily peak values.
翻译:对希望通过积极措施(例如临时交通封锁)减少污染对公众健康的影响的决策者来说,预测城市地区空气污染物浓度的能力至关重要。在本研究中,我们介绍了一种机器学习方法,用于预测瑞士南部若干地理区域臭氧浓度的日头最大值。由于测量站密度低以及使用案例地形复杂或地形复杂,我们采用了特征选择方法,而不是将相关特征明确限制在预测地点附近,这是时空预报方法中常见的特征。我们随后使用“毛片值”评估所学模型在臭氧预测方面特性重要性和特征相互作用的解释性;我们的分析表明,经过培训的模型有效地学习了大气变量之间相互依赖的解释性。最后,我们展示了加权观测如何有助于提高对特定臭氧日峰值范围的预测的准确性。