Ensemble learning has gained success in machine learning with major advantages over other learning methods. Bagging is a prominent ensemble learning method that creates subgroups of data, known as bags, that are trained by individual machine learning methods such as decision trees. Random forest is a prominent example of bagging with additional features in the learning process. Evolutionary algorithms have been prominent for optimisation problems and also been used for machine learning. Evolutionary algorithms are gradient-free methods that work with a population of candidate solutions that maintain diversity for creating new solutions. In conventional bagged ensemble learning, the bags are created once and the content, in terms of the training examples, are fixed over the learning process. In our paper, we propose evolutionary bagged ensemble learning, where we utilise evolutionary algorithms to evolve the content of the bags in order to iteratively enhance the ensemble by providing diversity in the bags. The results show that our evolutionary ensemble bagging method outperforms conventional ensemble methods (bagging and random forests) for several benchmark datasets under certain constraints. We find that evolutionary bagging can inherently sustain a diverse set of bags without reduction in performance accuracy.
翻译:与其它学习方法相比,综合学习在机器学习中取得了成功,比其他学习方法大有优势。 粘贴是一种突出的混合学习方法,它创造了数据分组,称为袋,由单机学习方法培训,如决策树等。 随机森林是加袋的突出例子,在学习过程中具有更多的特征。 进化算法对于优化问题十分突出,也用于机器学习。 进化算法是无梯度的方法,它与一批维持多样性的候选解决方案一起工作,以创造新的解决方案。 在常规的袋式混合学习中,包包被创建了一次,而内容(培训实例)在学习过程中固定下来。 在我们的论文中,我们提出进化加袋组合学习,我们利用进化算法来演化袋的内容,以便通过提供袋中的多样性来迭接地加强组合。 演化算法表明,我们进化的全套套方法超越了在一定制约下若干基准数据集的常规混合方法(拖动和随机森林)。 我们发现,进化袋的精确性可以维持一套不同的组合。