Deep neural networks (DNNs) have been shown to outperform traditional machine learning algorithms in a broad variety of application domains due to their effectiveness in modeling intricate problems and handling high-dimensional datasets. Many real-life datasets, however, are of increasingly high dimensionality, where a large number of features may be irrelevant to the task at hand. The inclusion of such features would not only introduce unwanted noise but also increase computational complexity. Furthermore, due to high non-linearity and dependency among a large number of features, DNN models tend to be unavoidably opaque and perceived as black-box methods because of their not well-understood internal functioning. A well-interpretable model can identify statistically significant features and explain the way they affect the model's outcome. In this paper, we propose an efficient method to improve the interpretability of black-box models for classification tasks in the case of high-dimensional datasets. To this end, we first train a black-box model on a high-dimensional dataset to learn the embeddings on which the classification is performed. To decompose the inner working principles of the black-box model and to identify top-k important features, we employ different probing and perturbing techniques. We then approximate the behavior of the black-box model by means of an interpretable surrogate model on the top-k feature space. Finally, we derive decision rules and local explanations from the surrogate model to explain individual decisions. Our approach outperforms and competes with state-of-the-art methods such as TabNet, XGboost, and SHAP-based interpretability techniques when tested on different datasets with varying dimensionality between 50 and 20,000.
翻译:深神经网络( DNNS) 显示在广泛的应用领域,由于在模拟复杂问题和处理高维数据集方面的有效性,DNN模型往往不易避免地不透明,并被视为黑箱方法。许多真实的数据集具有越来越高的维度,其中许多特征可能与手头的任务无关。加入这些特性不仅会引入不必要的噪音,而且会增加计算复杂性。此外,由于许多特性之间高度的非线性和依赖性,DNN模型往往在模型复杂问题和处理高维数据集方面的效力,因此在广泛的应用领域比传统的机器学习算法要强得多。一个良好的互换模型可以识别具有统计意义的特征,并解释它们如何影响模型的结果。在本文中,我们提出了一个有效的方法来改进黑箱模型在高维数据集方面对分类任务的解释性。我们首先将黑箱模型的模型模型和黑箱方法放在高端数据库中,然后将黑箱方法用于学习进行分类的嵌入式,然后用我们从黑箱的黑箱规则来解释。我们用一个重要的黑箱模型来解释,然后用不同的黑箱模型来解释,我们用来解释,然后从黑箱模型和黑箱方法来解释。