While rule-based attribution methods have proven useful for providing local explanations for Deep Neural Networks, explaining modern and more varied network architectures yields new challenges in generating trustworthy explanations, since the established rule sets might not be sufficient or applicable to novel network structures. As an elegant solution to the above issue, network canonization has recently been introduced. This procedure leverages the implementation-dependency of rule-based attributions and restructures a model into a functionally identical equivalent of alternative design to which established attribution rules can be applied. However, the idea of canonization and its usefulness have so far only been explored qualitatively. In this work, we quantitatively verify the beneficial effects of network canonization to rule-based attributions on VGG-16 and ResNet18 models with BatchNorm layers and thus extend the current best practices for obtaining reliable neural network explanations.
翻译:虽然事实证明,基于规则的归属方法有助于为深神经网络提供当地的解释,但解释现代和更加多样化的网络结构在产生值得信赖的解释方面产生了新的挑战,因为既定的成套规则可能不足以或不适用于新的网络结构,作为上述问题的一个优雅解决办法,最近采用了网络可移植办法,这一程序利用基于规则的归属的实施依赖性,并将一个模式重组为功能上与可适用既定归属规则的替代设计相同的模式,但是,迄今为止,仅仅从质量上探讨了集成的概念及其效用。在这项工作中,我们从数量上核实了网络化对VGG-16和ResNet18模型的基于规则的归属的有利影响,并用BatchNorm层来扩展目前获取可靠的神经网络解释的最佳做法。