Flight delays impose challenges that impact any flight transportation system. Predicting when they are going to occur is an important way to mitigate this issue. However, the behavior of the flight delay system varies through time. This phenomenon is known in predictive analytics as concept drift. This paper investigates the prediction performance of different drift handling strategies in aviation under different scales (models trained from flights related to a single airport or the entire flight system). Specifically, two research questions were proposed and answered: (i) How do drift handling strategies influence the prediction performance of delays? (ii) Do different scales change the results of drift handling strategies? In our analysis, drift handling strategies are relevant, and their impacts vary according to scale and machine learning models used.
翻译:飞行延误带来了影响任何飞行运输系统的挑战。预测何时会发生,是缓解这一问题的重要途径。不过,飞行延迟系统的行为因时间而异。这一现象在预测分析中被称为概念漂移。本文调查不同规模(从单一机场或整个飞行系统相关航班培训的模型)航空中不同漂移处理战略的预测性能。具体地说,提出并回答了两个研究问题:(一) 漂移处理战略如何影响延迟预测性能? (二) 不同规模是否改变漂移处理战略的结果?在我们的分析中,漂移处理战略是相关的,其影响因所使用的规模和机器学习模式而不同。