Prediction of network traffic behavior is significant for the effective management of modern telecommunication networks. However, the intuitive approach of predicting network traffic using administrative experience and market analysis data is inadequate for an efficient forecast framework. As a result, many different mathematical models have been studied to capture the general trend of the network traffic and predict accordingly. But the comprehensive performance analysis of varying regression models and their ensemble has not been studied before for analyzing real-world anomalous traffic. In this paper, several regression models such as Extra Gradient Boost (XGBoost), Light Gradient Boosting Machine (LightGBM), Stochastic Gradient Descent (SGD), Gradient Boosting Regressor (GBR), and CatBoost Regressor were analyzed to predict real traffic without and with outliers and show the significance of outlier detection in real-world traffic prediction. Also, we showed the outperformance of the ensemble regression model over the individual prediction model. We compared the performance of different regression models based on five different feature sets of lengths 6, 9, 12, 15, and 18. Our ensemble regression model achieved the minimum average gap of 5.04% between actual and predicted traffic with nine outlier-adjusted inputs. In general, our experimental results indicate that the outliers in the data can significantly impact the quality of the prediction. Thus, outlier detection and mitigation assist the regression model in learning the general trend and making better predictions.
翻译:网络交通行为的预测对现代电信网络的有效管理意义重大,然而,利用行政经验和市场分析数据预测网络交通的直观方法不足以形成一个有效的预测框架。因此,对许多不同的数学模型进行了研究,以了解网络交通的总体趋势并作出相应的预测。但是,在分析真实世界异常交通流量之前,尚未对不同回归模型及其组合的全面绩效分析进行过研究。在本文中,若干回归模型的绩效分析模型,如超重波(XGBoost),轻重重重力推车机器(LightGBM),小重力推车趋势梯子(SGD),重力推后推车(GBR)和CatBoost Regrestrator(Cat Boost Regrestor)等综合模型进行了分析,以了解网络交通的全局趋势,并显示在现实世界交通预测中发现异常值。此外,我们比较了基于五种不同特征的不同回归模型的绩效,分别是9号、12号、15号和18号不同的回归模型,从而明显地测量了我们的平均预测结果。