Random Forests and related tree-based methods are popular for supervised learning from table based data. Apart from their ease of parallelization, their classification performance is also superior. However, this performance, especially parallelizability, is offset by the loss of explainability. Statistical methods are often used to compensate for this disadvantage. Yet, their ability for local explanations, and in particular for global explanations, is limited. In the present work we propose an algebraic method, rooted in lattice theory, for the (global) explanation of tree ensembles. In detail, we introduce two novel conceptual views on tree ensemble classifiers and demonstrate their explanatory capabilities on Random Forests that were trained with standard parameters.
翻译:随机森林和相关基于树木的方法在从基于表格的数据中监督学习方面很受欢迎。除了容易平行化之外,它们的分类性能也比较优异。但是,这种业绩,特别是平行性,被解释性损失所抵消。统计方法往往用来弥补这一不利之处。然而,它们在当地解释的能力,特别是全球解释的能力是有限的。在目前的工作中,我们为(全球)解释树木集合提出了一种代数法,该代数法植根于拉蒂斯理论。我们详细介绍了两种关于树群分类的新的概念性观点,并展示了它们在经过标准参数培训的随机森林上的解释能力。