In many evolutionary computation systems, parent selection methods can affect, among other things, convergence to a solution. In this paper, we present a study comparing the role of two commonly used parent selection methods in evolving machine learning pipelines in an automated machine learning system called Tree-based Pipeline Optimization Tool (TPOT). Specifically, we demonstrate, using experiments on multiple datasets, that lexicase selection leads to significantly faster convergence as compared to NSGA-II in TPOT. We also compare the exploration of parts of the search space by these selection methods using a trie data structure that contains information about the pipelines explored in a particular run.
翻译:在许多进化计算系统中,母体选择方法除其他外,会影响与解决方案的趋同。在本文中,我们提出一项研究,比较两种常用母体选择方法在自动机器学习系统“植树管优化工具(TPOT ) ” 中不断发展的机器学习管道中的作用。具体地说,我们通过在多个数据集上进行实验,证明与TPOT中的NSGA-II相比,单体选择导致大大加快了趋同速度。我们还利用含有特定运行中所探索管道信息的三角数据结构,比较了这些选择方法对搜索空间部分的探索。