In recent years, there has been significant work on increasing both interpretability and debuggability of a Deep Neural Network (DNN) by extracting a rule-based model that approximates its decision boundary. Nevertheless, current DNN rule extraction methods that consider a DNN's latent space when extracting rules, known as decompositional algorithms, are either restricted to single-layer DNNs or intractable as the size of the DNN or data grows. In this paper, we address these limitations by introducing ECLAIRE, a novel polynomial-time rule extraction algorithm capable of scaling to both large DNN architectures and large training datasets. We evaluate ECLAIRE on a wide variety of tasks, ranging from breast cancer prognosis to particle detection, and show that it consistently extracts more accurate and comprehensible rule sets than the current state-of-the-art methods while using orders of magnitude less computational resources. We make all of our methods available, including a rule set visualisation interface, through the open-source REMIX library (https://github.com/mateoespinosa/remix).
翻译:近年来,在提高深神经网络(DNN)可解释性和可调试性方面做了大量工作,方法是通过提取一个符合规则的模型,接近其决定界限;然而,目前的DNN规则提取方法,在提取规则时考虑DNN的潜在空间,称为分解算法,要么局限于单层DNN,要么随着DNN或数据的增长而难以解决这些限制;在本文件中,我们通过引入新的多时规则提取算法ECLAIRE来解决这些限制问题,ECLAIRE是一种新颖的超时规则提取算法,既可以推广到大型DNN结构和大型培训数据集。我们评估ECLAIRE的多种任务,从乳腺癌预测到粒子检测,并表明它不断提取比目前最先进的方法更准确、更易理解的规则,同时使用数量级的顺序更少的计算资源。我们通过开放源REMIX图书馆(https://github.com/metespinos/remix)提供我们所有方法,包括规则设置可视化界面。