Ensembles based on k nearest neighbours (kNN) combine a large number of base learners, each constructed on a sample taken from a given training data. Typical kNN based ensembles determine the k closest observations in the training data bounded to a test sample point by a spherical region to predict its class. In this paper, a novel random projection extended neighbourhood rule (RPExNRule) ensemble is proposed where bootstrap samples from the given training data are randomly projected into lower dimensions for additional randomness in the base models and to preserve features information. It uses the extended neighbourhood rule (ExNRule) to fit kNN as base learners on randomly projected bootstrap samples.
翻译:基于k最近邻的集成模型结合大量的基础学习者,每个学习者都建立在给定训练数据所提取的样本上。典型的k最近邻基因神经集成模型通过一个球形区域界定训练数据中离测试样本点最近的k个观测值以预测其类别。本文提出了一种新的随机投影扩展邻域规则(RPExNRule)集成模型,其中从给定训练数据中随机投影样本至低维度,以便增加基础学习者的随机性并且保留特征信息。该模型使用扩展邻域规则(ExNRule)来拟合基于k最近邻算法的基础学习者在随机投影的bootstrap样本上。