A neural network with the widely-used ReLU activation has been shown to partition the sample space into many convex polytopes for prediction. However, the parameterized way a neural network and other machine learning models use to partition the space has imperfections, \textit{e}.\textit{g}., the compromised interpretability for complex models, the inflexibility in decision boundary construction due to the generic character of the model, and the risk of being trapped into shortcut solutions. In contrast, although the non-parameterized models can adorably avoid or downplay these issues, they are usually insufficiently powerful either due to over-simplification or the failure to accommodate the manifold structures of data. In this context, we first propose a new type of machine learning models referred to as Manifoldron that directly derives decision boundaries from data and partitions the space via manifold structure discovery. Then, we systematically analyze the key characteristics of the Manifoldron such as manifold characterization capability and its link to neural networks. The experimental results on 4 synthetic examples, 20 public benchmark datasets, and 1 real-world application demonstrate that the proposed Manifoldron performs competitively compared to the mainstream machine learning models. We have shared our code in \url{https://github.com/wdayang/Manifoldron} for free download and evaluation.
翻译:以广泛使用的 ReLU 激活的神经网络已经显示,可以将样本空间分割成许多 convex 多元桌面进行预测。然而,用于分割空间的神经网络和其他机器学习模型的参数化方法有不完善之处,即:\textit{e}.\textit{g}。 复杂模型的可解释性受损,由于模型的通用性质而决定边界构造的不灵活,以及被困在捷径解决方案中的风险。相比之下,虽然非参数化模型可以很好地避免或淡化这些问题,但由于过度简化或无法容纳数据多重结构,这些模型通常不够强大。在此情况下,我们首先提出一种新型机器学习模型,称为Maniterron,直接从数据中获取决定界限,通过多维结构发现空间分隔。然后,我们系统地分析该模型的关键特征,例如多重特征描述能力及其与神经网络的链接。关于4个合成示例的实验结果,20个公共基准数据集,以及1个真实世界的竞争性应用软件,显示我们在MAN 和We- dirmal-drob 上进行对比的模拟学习。