使用代表性学习的静态分析工具的排名警告 (Ranking Warnings of Static Analysis Tools Using Representation Learning)

Static analysis tools are frequently used to detect potential vulnerabilities in software systems. However, an inevitable problem of these tools is their large number of warnings with a high false positive rate, which consumes time and effort for investigating. In this paper, we present DeFP, a novel method for ranking static analysis warnings. Based on the intuition that warnings which have similar contexts tend to have similar labels (true positive or false positive), DeFP is built with two BiLSTM models to capture the patterns associated with the contexts of labeled warnings. After that, for a set of new warnings, DeFP can calculate and rank them according to their likelihoods to be true positives (i.e., actual vulnerabilities). Our experimental results on a dataset of 10 real-world projects show that using DeFP, by investigating only 60% of the warnings, developers can find +90% of actual vulnerabilities. Moreover, DeFP improves the state-of-the-art approach 30% in both Precision and Recall.

翻译：经常使用静态分析工具来发现软件系统中的潜在脆弱性。然而,这些工具的一个不可避免的问题是,它们使用大量假正率高的警告,这花费了时间和调查努力。我们在本文件中介绍了DeFP,这是排列静态分析警告的新方法。根据类似情况下的警告往往有相似标签的直觉(真正的正或假正),DeFP是用两个BILSTM模型构建的,以捕捉与标签警告相关的模式。之后,对于一套新警告,DeFP可以根据其真实的正率(即实际脆弱性)来计算和排序。我们在10个真实世界项目数据集上的实验结果显示,通过对60%的警告进行调查,开发者可以找到实际脆弱性的+90%。此外,DeFP在精密度和回记中改进了最先进的30%方法。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

深度学习搜索，Exploring Deep Learning for Search

专知会员服务

61+阅读 · 2020年5月9日

【ICLR2020-Facebook 2020】深度学习符号化数学，Deep Learning for Symbolic Mathematics，

专知会员服务

23+阅读 · 2020年4月7日

【ACL2020-Facebook AI】跨语言表示学习，Unsupervised Cross-lingual Representation Learning at Scale

专知会员服务

27+阅读 · 2020年4月5日

【ACL2020-Facebook AI】大规模无监督跨语言表示学习

专知会员服务

34+阅读 · 2020年4月5日