MonoKAN：经认证的单调Kolmogorov-Arnold网络 (MonoKAN: Certified Monotonic Kolmogorov-Arnold Network)

Artificial Neural Networks (ANNs) have significantly advanced various fields by effectively recognizing patterns and solving complex problems. Despite these advancements, their interpretability remains a critical challenge, especially in applications where transparency and accountability are essential. To address this, explainable AI (XAI) has made progress in demystifying ANNs, yet interpretability alone is often insufficient. In certain applications, model predictions must align with expert-imposed requirements, sometimes exemplified by partial monotonicity constraints. While monotonic approaches are found in the literature for traditional Multi-layer Perceptrons (MLPs), they still face difficulties in achieving both interpretability and certified partial monotonicity. Recently, the Kolmogorov-Arnold Network (KAN) architecture, based on learnable activation functions parametrized as splines, has been proposed as a more interpretable alternative to MLPs. Building on this, we introduce a novel ANN architecture called MonoKAN, which is based on the KAN architecture and achieves certified partial monotonicity while enhancing interpretability. To achieve this, we employ cubic Hermite splines, which guarantee monotonicity through a set of straightforward conditions. Additionally, by using positive weights in the linear combinations of these splines, we ensure that the network preserves the monotonic relationships between input and output. Our experiments demonstrate that MonoKAN not only enhances interpretability but also improves predictive performance across the majority of benchmarks, outperforming state-of-the-art monotonic MLP approaches.

翻译：人工神经网络（ANNs）通过有效识别模式与解决复杂问题，显著推动了多个领域的进展。尽管取得了这些进步，其可解释性仍是一个关键挑战，尤其在需要透明度和可问责性的应用中。为应对此问题，可解释人工智能（XAI）在揭示ANN内部机制方面取得进展，但仅凭可解释性往往不足。在某些应用中，模型预测必须符合专家设定的要求，有时体现为部分单调性约束。虽然文献中已针对传统多层感知机（MLPs）提出了单调性方法，但这些方法在同时实现可解释性与经认证的部分单调性方面仍面临困难。近期，基于可学习激活函数（参数化为样条）的Kolmogorov-Arnold网络（KAN）架构被提出，作为MLPs更具可解释性的替代方案。在此基础上，我们提出一种名为MonoKAN的新型ANN架构，它基于KAN架构，在增强可解释性的同时实现了经认证的部分单调性。为实现这一目标，我们采用三次Hermite样条，其通过一组简明条件保证了单调性。此外，通过在这些样条的线性组合中使用正权重，我们确保网络保持了输入与输出间的单调关系。实验表明，MonoKAN不仅提升了可解释性，还在大多数基准测试中改善了预测性能，优于当前最先进的单调MLP方法。

相关内容

可解释性

关注 81

广义上的可解释性指在我们需要了解或解决一件事情的时候，我们可以获得我们所需要的足够的可以理解的信息，也就是说一个人能够持续预测模型结果的程度。按照可解释性方法进行的过程进行划分的话，大概可以划分为三个大类：在建模之前的可解释性方法，建立本身具备可解释性的模型，在建模之后使用可解释性方法对模型作出解释。

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日