We focus on modeling the relationship between an input feature vector and the predicted outcome of a trained decision tree using mixed-integer optimization. This can be used in many practical applications where a decision tree or tree ensemble is incorporated into an optimization problem to model the predicted outcomes of a decision. We propose tighter mixed-integer optimization formulations than those previously introduced. Existing formulations can be shown to have linear relaxations that have fractional extreme points, even for the simple case of modeling a single decision tree. A formulation we propose, based on a projected union of polyhedra approach, is ideal for a single decision tree. While the formulation is generally not ideal for tree ensembles or if additional constraints are added, it generally has fewer extreme points, leading to a faster time to solve, particularly if the formulation has relatively few trees. However, previous work has shown that formulations based on a binary representation of the feature vector perform well computationally and hence are attractive for use in practical applications. We present multiple approaches to tighten existing formulations with binary vectors, and show that fractional extreme points are removed when there are multiple splits on the same feature. At an extreme, we prove that this results in ideal formulations for tree ensembles modeling a one-dimensional feature vector. Building on this result, we also show via numerical simulations that these additional constraints result in significantly tighter linear relaxations when the feature vector is low dimensional. We also present instances where the time to solve to optimality is significantly improved using these formulations.
翻译:我们的重点是模拟输入特性矢量与使用混合整数优化的经过培训的决策树的预测结果之间的关系。 这可用于许多实际应用中, 将决策树或树集合纳入优化问题, 以模拟某项决定的预测结果。 我们建议采用混合整数优化, 以混合整数优化方式建模。 我们提议了一种配方, 以预测的多角度结合方式建模, 这对于单一决定树是理想的。 虽然在很多实际应用中, 将决策树或树团合体纳入优化问题, 以模拟某项决定的预测结果。 我们提议了更为严格的混合整数优化的配方, 但通常没有那么极端的点, 导致更快的解算时间, 特别是如果配方的树木相对较少。 但是, 现有的配方显示, 基于特性矢量二进制的配方在计算上效果良好, 因此在实际应用中很有吸引力。 我们提出了多种方法, 将现有的配方与二进量矢量结合, 并表明当树团组合中存在多种分数极端点时, 当树团的精度配值是多分制时,, 也是通过一个理想的直径模型显示, 在一个模型中, 一种模型中, 我们通过一个模型中, 一种极端的制成为一个模型显示一个模型的比成为一种最接近。</s>