A system of nested dichotomies is a method of decomposing a multi-class problem into a collection of binary problems. Such a system recursively applies binary splits to divide the set of classes into two subsets, and trains a binary classifier for each split. Many methods have been proposed to perform this split, each with various advantages and disadvantages. In this paper, we present a simple, general method for improving the predictive performance of nested dichotomies produced by any subset selection techniques that employ randomness to construct the subsets. We provide a theoretical expectation for performance improvements, as well as empirical results showing that our method improves the root mean squared error of nested dichotomies, regardless of whether they are employed as an individual model or in an ensemble setting.
翻译:嵌入二进制二进制二进制系统是一种将多级问题分解成二进制问题集的方法。 这样的系统会循环地应用二进制分解法将分类组分成两个子集, 并训练每个分解的二进制分类器。 已经提出了许多方法来进行这种分解, 每个分解都有各种优缺点。 在本文中, 我们提出了一个简单、 通用的方法来改进由使用随机性构建子集集的子集选择技术生成的嵌入二进制二进制二进制二进制的预测性能。 我们对性能的改进提供理论期望, 以及实验结果显示我们的方法改善了嵌入二进制二进制的根正方位错误, 不论它们是作为单个模型还是作为共合体设置使用。