查找压缩 DNN 图像分类模型的偏离行为 (Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications)

from arxiv, This is the author version. The DOI of the published version is http://dx.doi.org/10.1145/3583564. Please see the full abstract in the pdf

Model compression can significantly reduce the sizes of deep neural network (DNN) models, and thus facilitates the dissemination of sophisticated, sizable DNN models, especially for their deployment on mobile or embedded devices. However, the prediction results of compressed models may deviate from those of their original models. To help developers thoroughly understand the impact of model compression, it is essential to test these models to find those deviated behaviors before dissemination. However, this is a non-trivial task because the architectures and gradients of compressed models are usually not available. To this end, we propose DFLARE, a novel, search-based, black-box testing technique to automatically find triggering inputs that result in deviated behaviors in image classification tasks. DFLARE iteratively applies a series of mutation operations to a given seed image, until a triggering input is found. For better efficacy and efficiency, DFLARE models the search problem as Markov Chains and leverages the Metropolis-Hasting algorithm to guide the selection of mutation operators in each iteration. Further, DFLARE utilizes a novel fitness function to prioritize the mutated inputs that either cause large differences between two models' outputs, or trigger previously unobserved models' probability vectors. We evaluated DFLARE on 21 compressed models for image classification tasks with three datasets. The results show that DFLARE outperforms the baseline in terms of efficacy and efficiency. We also demonstrated that the triggering inputs found by DFLARE can be used to repair up to 48.48% deviated behaviors in image classification tasks and further decrease the effectiveness of DFLARE on the repaired models.

翻译：模型压缩可以大大缩小深神经网络(DNN)模型的大小,从而有利于传播复杂、可扩展的DNN模型,特别是用于移动或嵌入装置的模型。然而,压缩模型的预测结果可能与原始模型的预测结果不同。为了帮助开发者彻底理解模型压缩的影响,必须测试这些模型,以便在传播之前找到这些偏离的行为。然而,这是一项非边际任务,因为压缩模型的架构和梯度通常不可用。为此,我们提议DFLARE(一种新型的、基于搜索的、黑箱测试技术),以自动发现触发输入,从而导致图像分类任务中出现偏差的行为。DFLARE(D)将一系列突变操作操作应用给某个种子图像,直到找到触发输入。为了提高效能和效率,DFLARE(Markov 链)和Teopolis-Hasting 算法算法(Metopolis-Hasting 算法)来指导每次循环中的变异操作者的选择。此外,DFLARE(DARE)利用一种非新型的变更精确的计算功能,在图像模型上,我们用三级的变动的变动的变动的变动的变动的变压模型可以显示前的变动的变压变压变压变压变压变压的变动的变压的变压变压变压的变压的变动的DRLALALALALALLLLLLLLLLLLLLLLLLLL) 。