We transfer distances on clusterings to the building process of decision trees, and as a consequence extend the classical ID3 algorithm to perform modifications based on the global distance of the tree to the ground truth--instead of considering single leaves. Next, we evaluate this idea in comparison with the original version and discuss occurring problems, but also strengths of the global approach. On this basis, we finish by identifying other scenarios where global evaluations are worthwhile.
翻译:我们把集群的距离转移到决策树的建设过程中,因此,我们扩大了传统的ID3算法,根据树的全球距离对地面进行修改,而不是考虑单叶。 其次,我们比照原始版本来评估这一想法,并讨论正在发生的问题,但也讨论全球方法的长处。 在此基础上,我们通过确定其他值得进行全球评估的情景来完成这项工作。