通过预测-保留最小化最小化,通过预测-最小化最小化,验证示范信号-认知 (Probing Model Signal-Awareness via Prediction-Preserving Input Minimization)

This work explores the signal awareness of AI models for source code understanding. Using a software vulnerability detection use-case, we evaluate the models' ability to capture the correct vulnerability signals to produce their predictions. Our prediction-preserving input minimization (P2IM) approach systematically reduces the original source code to a minimal snippet which a model needs to maintain its prediction. The model's reliance on incorrect signals is then uncovered when a vulnerability in the original code is missing in the minimal snippet, both of which the model however predicts as being vulnerable. We apply P2IM on three state-of-the-art neural network models across multiple datasets, and measure their signal awareness using a new metric we propose- Signal-aware Recall (SAR). The results show a sharp drop in the model's Recall from the high 90s to sub-60s with the new metric, highlighting that the models are presumably picking up a lot of noise or dataset nuances while learning their vulnerability detection logic.

翻译：这项工作探索了 AI 模型的信号意识, 以便了解源代码理解。我们使用软件脆弱性检测使用案例, 评估模型捕捉正确脆弱性信号的能力, 以便做出预测。我们的预测- 保存输入最小化( P2IM) 方法系统地将原始源代码降低到一个最小的片段, 模型需要保持其预测。当原始代码中的脆弱性在最小的片段缺失时, 模型对错误信号的依赖就会暴露出来, 两种模式都预测是脆弱的。我们用三种最先进的神经网络模型在多个数据集中应用 P2IM, 并用我们提议的新的指标- 信号- 警告回调( SAR) 来衡量它们的信号意识。结果表明模型的回想从90年代高到60年代的次点点急剧下降, 并强调指出, 模型在学习其脆弱性检测逻辑的同时, 很可能会收集到大量的噪音或数据设定的细微值。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

专知会员服务

17+阅读 · 2020年3月9日