Ensemble trees are a popular machine learning model which often yields high prediction performance when analysing structured data. Although individual small decision trees are deemed explainable by nature, an ensemble of large trees is often difficult to understand. In this work, we propose an approach called optimised explanation (OptExplain) that faithfully extracts global explanations of ensemble trees using a combination of logical reasoning, sampling and optimisation. Building on top of this, we propose a method called the profile of equivalent classes (ProClass), which uses MAX-SAT to simplify the explanation even further. Our experimental study on several datasets shows that our approach can provide high-quality explanations to large ensemble trees models, and it betters recent top-performers.
翻译:集合树木是一种流行的机器学习模型,在分析结构化数据时往往能产生高预测性效果。虽然个体小决策树被视为自然界可以解释,但许多大树往往难以理解。在这项工作中,我们提出了一个称为优化解释(OptExplain)的方法,它利用逻辑推理、采样和优化的结合,忠实地提取对组合树木的全球解释。在此基础上,我们提出了一种称为等同类别(ProClasss)配置的方法,它使用MAX-SAT来进一步简化解释。 我们对几个数据集的实验研究表明,我们的方法可以向大型组合树木模型提供高质量的解释,并且改进了最近的顶层模型。