This paper summarizes eight design requirements for DNN testing criteria, taking into account distribution properties and practical concerns. We then propose a new criterion, NLC, that satisfies all of these design requirements. NLC treats a single DNN layer as the basic computational unit (rather than a single neuron) and captures four critical features of neuron output distributions. Thus, NLC is denoted as NeuraL Coverage, which more accurately describes how neural networks comprehend inputs via approximated distributions rather than neurons. We demonstrate that NLC is significantly correlated with the diversity of a test suite across a number of tasks (classification and generation) and data formats (image and text). Its capacity to discover DNN prediction errors is promising. Test input mutation guided by NLC result in a greater quality and diversity of exposed erroneous behaviors.
翻译:本文总结了DNN测试标准的八项设计要求,同时考虑到分布特性和实际问题。然后,我们提出了一个满足所有这些设计要求的新标准NLC。NLC将单一DNN层作为基本计算单位(而不是单一神经元),并捕捉神经输出分布的四个关键特征。因此,NLC被称作NeuraL覆盖,更准确地描述了神经网络如何通过大致分布而不是神经元来理解输入。我们证明NLC与一系列任务(分类和生成)和数据格式(图像和文本)的测试套件多样性密切相关。发现DNNN的预测错误的能力是大有希望的。NLC引导的试验输入突变导致暴露错误行为的更高质量和多样性。