Having reliable specifications is an unavoidable challenge in achieving verifiable correctness, robustness, and interpretability of AI systems. Existing specifications for neural networks are in the paradigm of data as specification. That is, the local neighborhood centering around a reference input is considered to be correct (or robust). However, our empirical study shows that such a specification is extremely overfitted since usually no data points from the testing set lie in the certified region of the reference input, making them impractical for real-world applications. We propose a new family of specifications called neural representation as specification, which uses the intrinsic information of neural networks - neural activation patterns (NAP), rather than input data to specify the correctness and/or robustness of neural network predictions. We present a simple statistical approach to mining dominant neural activation patterns. We analyze NAPs from a statistical point of view and find that a single NAP can cover a large number of training and testing data points whereas ad hoc data-as-specification only covers the given reference data point. To show the effectiveness of discovered NAPs, we formally verify several important properties, such as various types of misclassifications will never happen for a given NAP, and there is no-ambiguity between different NAPs. We show that by using NAP, we can verify the prediction of the entire input space, while still recalling 84% of the data. Thus, we argue that using NAPs is a more reliable and extensible specification for neural network verification.
翻译:具备可靠的规格是实现可核实的正确性、稳健性和可解释性AI系统的一个不可避免的挑战。神经网络的现有规格是作为规格的数据范式中的现有规格。也就是说,以参考投入为中心的地方邻居被认为是正确的(或稳健的)。然而,我们的实证研究显示,这种规格是极其过分的,因为测试集的数据点通常没有在经认证的参考投入区域中进行,而临时数据具体化仅涵盖给定的参考数据点。为了显示所发现的国家行动方案的有效性,我们正式核实了一些重要的属性,例如神经网络的内在信息----神经激活模式(NAP),而不是输入数据,以具体说明神经网络预测的正确性和/或稳健性。我们为挖掘占主导地位的神经激活模式提出了一个简单的统计方法。我们从统计角度分析国家行动方案,发现单一的国家行动方案可以涵盖大量的培训和测试数据点,而临时数据具体化仅涵盖给定的参考数据点。我们正式核实了一些重要的属性,例如各种分类错误的内脏模型将永远不会出现,而使用84个国家行动方案的精确性,同时我们还要确认84个国家行动方案的精确性。