We show how neural models can be used to realize piece-wise constant functions such as decision trees. The proposed architecture, which we call locally constant networks, builds on ReLU networks that are piece-wise linear and hence their associated gradients with respect to the inputs are locally constant. We formally establish the equivalence between the classes of locally constant networks and decision trees. Moreover, we highlight several advantageous properties of locally constant networks, including how they realize decision trees with parameter sharing across branching / leaves. Indeed, only $M$ neurons suffice to implicitly model an oblique decision tree with $2^M$ leaf nodes. The neural representation also enables us to adopt many tools developed for deep networks (e.g., DropConnect (Wan et al., 2013)) while implicitly training decision trees. We demonstrate that our method outperforms alternative techniques for training oblique decision trees in the context of molecular property classification and regression tasks.
翻译:我们展示了神经模型如何用于实现决策树等小块常态功能。 我们称之为本地常态网络的拟议架构以ReLU网络为基础,这些网络是小块线性网络,因此与投入相关的梯度是本地常态的。 我们正式确定本地常态网络和决定树的等值。 此外, 我们强调本地常态网络的若干优点, 包括它们如何实现决策树, 以及分枝/ 叶的参数共享 。 事实上, 仅$M 的神经元就足以隐含地模拟一个带有 2 ⁇ M$ 叶节点的斜面决策树。 神经代表还使我们能够采用为深层网络开发的许多工具( 例如, DroppConect (Wan等人, 2013)), 同时隐含地培训决策树。 我们证明我们的方法在分子属性分类和回归任务方面, 超越了培训斜面决策树的替代技术。