关于软件脆弱性评估模型使用精细脆弱守则说明的软件脆弱性评估模型 (On the Use of Fine-grained Vulnerable Code Statements for Software Vulnerability Assessment Models) - 专知论文

会员服务 ·

0

INFORMS · Performer · 马修斯相关系数 · 相关系数 · MoDELS ·

2022 年 3 月 16 日

On the Use of Fine-grained Vulnerable Code Statements for Software Vulnerability Assessment Models

翻译：关于软件脆弱性评估模型使用精细脆弱守则说明的软件脆弱性评估模型

Triet H. M. Le,M. Ali Babar

from arxiv, Accepted as a full paper in the technical track at the 19th International Conference on Mining Software Repositories (MSR) 2022

Many studies have developed Machine Learning (ML) approaches to detect Software Vulnerabilities (SVs) in functions and fine-grained code statements that cause such SVs. However, there is little work on leveraging such detection outputs for data-driven SV assessment to give information about exploitability, impact, and severity of SVs. The information is important to understand SVs and prioritize their fixing. Using large-scale data from 1,782 functions of 429 SVs in 200 real-world projects, we investigate ML models for automating function-level SV assessment tasks, i.e., predicting seven Common Vulnerability Scoring System (CVSS) metrics. We particularly study the value and use of vulnerable statements as inputs for developing the assessment models because SVs in functions are originated in these statements. We show that vulnerable statements are 5.8 times smaller in size, yet exhibit 7.5-114.5% stronger assessment performance (Matthews Correlation Coefficient (MCC)) than non-vulnerable statements. Incorporating context of vulnerable statements further increases the performance by up to 8.9% (0.64 MCC and 0.75 F1-Score). Overall, we provide the initial yet promising ML-based baselines for function-level SV assessment, paving the way for further research in this direction.

翻译：许多研究都开发了机器学习(ML)方法,以探测导致SV的功能和细微编码说明中的软件脆弱性。然而,在利用这种检测产出进行数据驱动的SV评估以提供关于SV的可利用性、影响和严重程度的信息方面,没有做多少工作。这些信息对于理解SV和确定其确定优先次序十分重要。利用200个现实世界项目中429个SV的1,782功能的大规模数据,我们调查了功能级SV评估任务自动化的ML模型,即预测了7个共同脆弱性分解系统(CVSS)指标。我们特别研究脆弱声明的价值和使用作为开发评估模型的投入,因为功能中的SV是这些声明的起源。我们表明,脆弱声明的大小是5.8倍,但比基于无弹性的报表(Matthews Correlegal covaly (MCC)) 高出7.5%-114.5%。纳入脆弱声明的背景进一步将业绩提高到8.9% (0.64 MCM和0.75 F-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-A-A-S-S-S-A)-A)-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A)-A-A-S-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于遥感同化的PM2.5源清单优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

以生物质碳为原料制备石墨烯复合材料的研究

国家自然科学基金

1+阅读 · 2012年12月31日

空间群组目标相似度计算模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

《软件学报》学术期刊

国家自然科学基金

6+阅读 · 2011年12月31日

基于分形的C/C复合材料结构辨识与性能预测系统研究

国家自然科学基金

0+阅读 · 2009年12月31日

流域非点源污染分布式模拟的不确定性分析方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于人工智能的故障诊断及容错控制技术在动力电池中的应用

国家自然科学基金

1+阅读 · 2009年12月31日

金属矿山生态化设计优化理论与方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Pt,Pd,Ni团簇和纳米颗粒催化氢反应力场的建立

国家自然科学基金

0+阅读 · 2009年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Judging the algorithm: A case study on the risk assessment tool for gender-based violence implemented in the Basque country

Arxiv

0+阅读 · 2022年4月20日

A Mobile Food Recognition System for Dietary Assessment

A Mobile Food Recognition System for Dietary Assessment

Arxiv

0+阅读 · 2022年4月20日

GAM(e) changer or not? An evaluation of interpretable machine learning models based on additive model constraints

Arxiv

0+阅读 · 2022年4月19日

Antipatterns in Software Classification Taxonomies

Antipatterns in Software Classification Taxonomies

Arxiv

0+阅读 · 2022年4月19日

On the Use of Causal Graphical Models for Designing Experiments in the Automotive Domain

Arxiv

0+阅读 · 2022年4月19日

Toward Understanding the Use of Centralized Exchanges for Decentralized Cryptocurrency

Arxiv

0+阅读 · 2022年4月19日

Trinary Tools for Continuously Valued Binary Classifiers

Arxiv

0+阅读 · 2022年4月18日

HIVE: Evaluating the Human Interpretability of Visual Explanations

Arxiv

0+阅读 · 2022年4月15日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

VIP会员

文章信息

相关主题

马修斯相关系数

相关VIP内容

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

AI智能体编程：技术、挑战与机遇综述

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Judging the algorithm: A case study on the risk assessment tool for gender-based violence implemented in the Basque country

Arxiv

0+阅读 · 2022年4月20日

A Mobile Food Recognition System for Dietary Assessment

A Mobile Food Recognition System for Dietary Assessment

Arxiv

0+阅读 · 2022年4月20日

GAM(e) changer or not? An evaluation of interpretable machine learning models based on additive model constraints

Arxiv

0+阅读 · 2022年4月19日

Antipatterns in Software Classification Taxonomies

Antipatterns in Software Classification Taxonomies

Arxiv

0+阅读 · 2022年4月19日

On the Use of Causal Graphical Models for Designing Experiments in the Automotive Domain

Arxiv

0+阅读 · 2022年4月19日

Toward Understanding the Use of Centralized Exchanges for Decentralized Cryptocurrency

Arxiv

0+阅读 · 2022年4月19日

Trinary Tools for Continuously Valued Binary Classifiers

Arxiv

0+阅读 · 2022年4月18日

HIVE: Evaluating the Human Interpretability of Visual Explanations

Arxiv

0+阅读 · 2022年4月15日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

相关基金

基于遥感同化的PM2.5源清单优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

以生物质碳为原料制备石墨烯复合材料的研究

国家自然科学基金

1+阅读 · 2012年12月31日

空间群组目标相似度计算模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

《软件学报》学术期刊

国家自然科学基金

6+阅读 · 2011年12月31日

基于分形的C/C复合材料结构辨识与性能预测系统研究

国家自然科学基金

0+阅读 · 2009年12月31日

流域非点源污染分布式模拟的不确定性分析方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于人工智能的故障诊断及容错控制技术在动力电池中的应用

国家自然科学基金

1+阅读 · 2009年12月31日

金属矿山生态化设计优化理论与方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Pt,Pd,Ni团簇和纳米颗粒催化氢反应力场的建立

国家自然科学基金

0+阅读 · 2009年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员