In this work we introduce an implementation for which machine learning techniques helped improve the overall performance of an evolutionary algorithm for an optimization problem, namely a variation of robust minimum-cost path in graphs. In this big data optimization problem, a path achieving a good cost in most scenarios from an available set of scenarios (generated by a simulation process) must be obtained. The most expensive task of our evolutionary algorithm, in terms of computational resources, is the evaluation of candidate paths: the fitness function must calculate the cost of the candidate path in every generated scenario. Given the large number of scenarios, this task must be implemented in a distributed environment. We implemented gradient boosting decision trees to classify candidate paths in order to identify good candidates. The cost of the not-so-good candidates is simply forecasted. We studied the training process, gain performance, accuracy, and other variables. Our computational experiments show that the computational performance was significantly improved at the expense of a limited loss of accuracy.
翻译:在这项工作中,我们引入了一种执行方法,即机器学习技术帮助改进了优化问题演进算法的总体性能,即改变图表中稳健的最低限度成本路径。在这种大数据优化问题中,必须从一套可用的假设情景(模拟过程产生的)中找到一条在多数假设情景中实现良好成本的途径。我们演进算法在计算资源方面最昂贵的任务是评估候选路径:健身功能必须计算每个生成的情景中候选路径的成本。考虑到众多的情景,这项任务必须在分布式环境中执行。我们实施了梯度加速决策树来对候选路径进行分类,以便确定好的候选人。只对不优秀的候选人的成本作出预测。我们研究了培训过程、提高性能、准确性和其他变量。我们的计算实验表明,计算性表现大大改进了,而牺牲了有限的准确性损失。