The precise estimation of resource usage is a complex and challenging issue due to the high variability and dimensionality of heterogeneous service types and dynamic workloads. Over the last few years, the prediction of resource usage and traffic has received ample attention from the research community. Many machine learning-based workload forecasting models have been developed by exploiting their computational power and learning capabilities. This paper presents the first systematic survey cum performance analysis-based comparative study of diversified machine learning-driven cloud workload prediction models. The discussion initiates with the significance of predictive resource management followed by a schematic description, operational design, motivation, and challenges concerning these workload prediction models. Classification and taxonomy of different prediction approaches into five distinct categories are presented focusing on the theoretical concepts and mathematical functioning of the existing state-of-the-art workload prediction methods. The most prominent prediction approaches belonging to a distinct class of machine learning models are thoroughly surveyed and compared. All five classified machine learning-based workload prediction models are implemented on a common platform for systematic investigation and comparison using three distinct benchmark cloud workload traces via experimental analysis. The essential key performance indicators of state-of-the-art approaches are evaluated for comparison and the paper is concluded by discussing the trade-offs and notable remarks.
翻译:由于多种服务类型和动态工作量的多变性和多面性,对资源使用情况的精确估计是一个复杂和具有挑战性的问题。过去几年来,对资源使用和流量的预测得到了研究界的足够重视。许多基于机械学习的工作量预测模型是通过利用计算能力和学习能力开发的。本文件介绍了对多种机器学习驱动云工作量预测模型的首次系统调查和基于业绩分析的比较研究。讨论的出发点是预测资源管理的重要性,然后是预测性描述、业务设计、动机和关于这些工作量预测模型的挑战。对五种不同类别的不同预测方法的分类和分类,重点是现有最新工作量预测方法的理论概念和数学功能。属于不同类型机算模型的最突出的预测方法经过彻底调查和比较。所有五个基于机学的分类工作量预测模型都是在一个共同的平台上实施的,通过实验分析,利用三种不同的基准云工作量痕迹进行系统调查和比较。对最新方法的基本关键业绩指标进行了评估,以供比较,并通过讨论贸易观点和显著的论文来完成。