解释用于高多维数据集的黑箱机器学习模型 (Interpreting Black-box Machine Learning Models for High Dimensional Datasets)

Deep neural networks (DNNs) have been shown to outperform traditional machine learning algorithms in a broad variety of application domains due to their effectiveness in modeling intricate problems and handling high-dimensional datasets. Many real-life datasets, however, are of increasingly high dimensionality, where a large number of features may be irrelevant to the task at hand. The inclusion of such features would not only introduce unwanted noise but also increase computational complexity. Furthermore, due to high non-linearity and dependency among a large number of features, DNN models tend to be unavoidably opaque and perceived as black-box methods because of their not well-understood internal functioning. A well-interpretable model can identify statistically significant features and explain the way they affect the model's outcome. In this paper, we propose an efficient method to improve the interpretability of black-box models for classification tasks in the case of high-dimensional datasets. To this end, we first train a black-box model on a high-dimensional dataset to learn the embeddings on which the classification is performed. To decompose the inner working principles of the black-box model and to identify top-k important features, we employ different probing and perturbing techniques. We then approximate the behavior of the black-box model by means of an interpretable surrogate model on the top-k feature space. Finally, we derive decision rules and local explanations from the surrogate model to explain individual decisions. Our approach outperforms and competes with state-of-the-art methods such as TabNet, XGboost, and SHAP-based interpretability techniques when tested on different datasets with varying dimensionality between 50 and 20,000.

翻译：深神经网络( DNNS) 显示在广泛的应用领域,由于在模拟复杂问题和处理高维数据集方面的有效性,DNN模型往往不易避免地不透明,并被视为黑箱方法。许多真实的数据集具有越来越高的维度,其中许多特征可能与手头的任务无关。加入这些特性不仅会引入不必要的噪音,而且会增加计算复杂性。此外,由于许多特性之间高度的非线性和依赖性,DNN模型往往在模型复杂问题和处理高维数据集方面的效力,因此在广泛的应用领域比传统的机器学习算法要强得多。一个良好的互换模型可以识别具有统计意义的特征,并解释它们如何影响模型的结果。在本文中,我们提出了一个有效的方法来改进黑箱模型在高维数据集方面对分类任务的解释性。我们首先将黑箱模型的模型模型和黑箱方法放在高端数据库中,然后将黑箱方法用于学习进行分类的嵌入式,然后用我们从黑箱的黑箱规则来解释。我们用一个重要的黑箱模型来解释,然后用不同的黑箱模型来解释,我们用来解释,然后从黑箱模型和黑箱方法来解释。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。