Existing ordinal trees and random forests typically use scores that are assigned to the ordered categories, which implies that a higher scale level is used. Versions of ordinal trees are proposed that take the scale level seriously and avoid the assignment of artificial scores. The basic construction principle is based on an investigation of the binary models that are implicitly used in parametric ordinal regression. These building blocks can be fitted by trees and combined in a similar way as in parametric models. The obtained trees use the ordinal scale level only. Since binary trees and random forests are constituent elements of the trees one can exploit the wide range of binary trees that have already been developed. A further topic is the potentially poor performance of random forests, which seems to have ignored in the literature. Ensembles that include parametric models are proposed to obtain prediction methods that tend to perform well in a wide range of settings. The performance of the methods is evaluated empirically by using several data sets.
翻译:现有的圆形树木和随机森林通常使用分配给定购类别的分数,这意味着使用更高的比例水平。建议了正正方形树木的版本,认真对待尺度水平,避免分配人为分数。基本建筑原则的基础是调查在参数或分数回归中暗含使用的二进制模型。这些建筑块可以由树木安装,并用与参数模型类似的方式结合。获得的树木仅使用正方形尺度水平。由于二进制树木和随机森林是树木的组成部分,因此可以利用已经开发的多种二进制树。另一个主题是随机森林的性能可能很差,文献中似乎忽视了这种情况。建议包含参数模型的集合,以获得往往在广泛环境中运行良好的预测方法。通过使用若干数据集对方法的性能进行实验性评估。