A neural network with the widely-used ReLU activation has been shown to partition the sample space into many convex polytopes for prediction. However, the parameterized way a neural network and other machine learning models use to partition the space has imperfections, e.g., the compromised interpretability for complex models, the inflexibility in decision boundary construction due to the generic character of the model, and the risk of being trapped into shortcut solutions. In contrast, although the non-parameterized models can adorably avoid or downplay these issues, they are usually insufficiently powerful either due to over-simplification or the failure to accommodate the manifold structures of data. In this context, we first propose a new type of machine learning models referred to as Manifoldron that directly derives decision boundaries from data and partitions the space via manifold structure discovery. Then, we systematically analyze the key characteristics of the Manifoldron including interpretability, manifold characterization capability, and its link to neural networks. The experimental results on 9 small and 11 large datasets demonstrate that the proposed Manifoldron performs competitively compared to the mainstream machine learning models. We have shared our code https://github.com/wdayang/Manifoldron for free download and evaluation.
翻译:已经展示出一个带有广泛使用的ReLU激活功能的神经网络,将样本空间分割成许多 convex 多元顶点以进行预测,然而,神经网络和其他机器学习模型用于分割空间的参数化模式存在不完善之处,例如,复杂模型的可解释性受损,由于模型的通用性而决定边界构造的不灵活,以及被困在捷径解决方案中的风险。相比之下,虽然非参数化模型可以很好地避免或淡化这些问题,但由于过于简单化或无法容纳多种数据结构,这些模型通常不够强大。在此情况下,我们首先提出一种称为Manfoldron的新型机器学习模型,直接从数据中获取决定界限,并通过多元结构发现分割空间。然后,我们系统地分析Manforron的关键特征,包括可解释性、多重定性能力及其与神经网络的联系。9个小型和11个大型数据集的实验结果显示,拟议的Manforterron与主流机器学习模型相比,具有竞争性。我们共享的代码是用于主流机器学习模型的。