Neural Networks and Decision Trees: two popular techniques for supervised learning that are seemingly disconnected in their formulation and optimization method, have recently been combined in a single construct. The connection pivots on assembling an artificial Neural Network with nodes that allow for a gate-like function to mimic a tree split, optimized using the standard approach of recursively applying the chain rule to update its parameters. Yet two main challenges have impeded wide use of this hybrid approach: (a) the inability of global gradient ascent techniques to optimize hierarchical parameters (as introduced by the gate function); and (b) the construction of the tree structure, which has relied on standard decision tree algorithms to learn the network topology or incrementally (and heuristically) searching the space at random. Here we propose a probabilistic construct that exploits the idea of a node's unexplained potential (the total error channeled through the node) in order to decide where to expand further, mimicking the standard tree construction in a Neural Network setting, alongside a modified gradient ascent that first locally optimizes an expanded node before a global optimization. The probabilistic approach allows us to evaluate each new split as a ratio of likelihoods that balances the statistical improvement in explaining the evidence against the additional model complexity --- thus providing a natural stopping condition. The result is a novel classification and regression technique that leverages the strength of both: a tree-structure that grows naturally and is simple to interpret with the plasticity of Neural Networks that allow for soft margins and slanted boundaries.
翻译:神经网络和决定树 神经网络和决定树 。 但两大挑战妨碍了广泛使用这种混合方法:(a) 全球梯度升温技术无法优化等级参数(如门边距功能所引入的);(b) 树结构的构建,该结构依靠标准决策树算法来学习网络结构表层学或增量(和超度)搜索空间随机的节点。在这里,我们提出一种概率结构,利用节点解释潜力的理念(通过节点引导的总误差),以便决定进一步扩展的位置,在神经网络中模拟标准树的构建,同时修改梯度,以显示在进行全球优化之前首先在当地优化简单的节点树结构。因此,我们提出了一种概率性结构,以便利用节点解释潜力(通过节点引导的总差错)的理念,以便决定进一步扩展,将标准树的树结构建在神经网络中进行模拟,同时修改梯度显示,在进行全球优化之前,首先在当地优化一个扩大节点的树结构,然后逐步(和超度)地) 学习网络结构。因此,一个自然结构分析结果可以让我们评估如何改变。