Automated algorithm selection and configuration methods that build on exploratory landscape analysis (ELA) are becoming very popular in Evolutionary Computation. However, despite a significantly growing number of applications, the underlying machine learning models are often chosen in an ad-hoc manner. We show in this work that three classical regression methods are able to achieve meaningful results for ELA-based algorithm selection. For those three models -- random forests, decision trees, and bagging decision trees -- the quality of the regression models is highly impacted by the chosen hyper-parameters. This has significant effects also on the quality of the algorithm selectors that are built on top of these regressions. By comparing a total number of 30 different models, each coupled with 2 complementary regression strategies, we derive guidelines for the tuning of the regression models and provide general recommendations for a more systematic use of classical machine learning models in landscape-aware algorithm selection. We point out that a choice of the machine learning model merits to be carefully undertaken and further investigated.
翻译:建立在探索性地貌分析(ELA)基础上的自动算法选择和配置方法在进化计算中正在变得非常流行。然而,尽管应用数量显著增加,基本机器学习模型往往以临时方式选择。我们在工作中表明,三种经典回归方法能够为以拉美经济体系为基础的算法选择取得有意义的结果。对于这三种模型 -- -- 随机森林、决策树和包装式决策树 -- -- 回归模型的质量受到所选超参数的高度影响。这也对在这些回归物之上建立的算法选择器的质量产生了重大影响。通过对总共30种不同的模型进行比较,每个模型加上2种互补回归战略,我们为回归模型的调整制定了指导方针,并为更系统地使用传统的机器学习模型选择地貌图法提供了一般建议。我们指出,选择机器学习模型的优点需要认真研究并进一步研究。