A spectral approximation of a Boolean function is proposed for approximating the decision boundary of an ensemble of Deep Neural Networks (DNNs) solving two-class pattern recognition problems. The Walsh combination of relatively weak DNN classifiers is shown experimentally to be capable of detecting adversarial attacks. By observing the difference in Walsh coefficient approximation between clean and adversarial images, it appears that transferability of attack may be used for detection. Approximating the decision boundary may also aid in understanding the learning and transferability properties of DNNs. While the experiments here use images, the proposed approach of modelling two-class ensemble decision boundaries could in principle be applied to any application area. Code for this paper implementing Walsh Coefficient Examples of approximating artificial Boolean functions can be found at https://doi.org/10.24433/CO.3695905.v1
翻译:提出布尔函数的光谱近似值,以接近深神经网络联合体(DNN)解决双级模式识别问题的决定边界。相对弱的DNN分类器的沃尔什组合实验性地显示,相对弱的DNN分类器能够发现对抗性攻击。通过观察清洁图像和对抗性图像之间沃尔什系数近似值的差异,似乎可以使用攻击的可转移性来探测。运用决定边界也有助于了解DNN的学习和可转移性。虽然此处的实验使用图像,但拟议的两级共同决定界限建模方法原则上可以适用于任何应用领域。本文的代码在https://doi.org/10.24433/CO.3695905.v1中可以找到落实人造布尔功能的低效系数示例。