ChatGPT模型在漏洞检测中的评估 (Evaluation of ChatGPT Model for Vulnerability Detection) - 专知论文

会员服务 ·

0

漏洞检测 · ChatGPT · 多标签分类 · 代码 · 分类器 ·

2023 年 4 月 12 日

Evaluation of ChatGPT Model for Vulnerability Detection

翻译：ChatGPT模型在漏洞检测中的评估

Anton Cheshkov,Pavel Zadorozhny,Rodion Levichev

In this technical report, we evaluated the performance of the ChatGPT and GPT-3 models for the task of vulnerability detection in code. Our evaluation was conducted on our real-world dataset, using binary and multi-label classification tasks on CWE vulnerabilities. We decided to evaluate the model because it has shown good performance on other code-based tasks, such as solving programming challenges and understanding code at a high level. However, we found that the ChatGPT model performed no better than a dummy classifier for both binary and multi-label classification tasks for code vulnerability detection.

翻译：在本技术报告中，我们针对漏洞检测任务，使用二进制和多标签分类任务在基于现实世界数据集上对ChatGPT和GPT-3模型的性能进行了评估。我们决定评估该模型是因为在其他基于代码的任务上已经显示出良好的性能，例如解决编程挑战和高级理解代码。然而，我们发现ChatGPT模型在二进制和多标签分类任务上的表现均不如虚拟分类器的性能。

0

相关内容

漏洞检测

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

【ACL2020-浙大-微软】多轮对话推理数据集，MuTual: A Dataset for Multi-Turn Dialogue Reasoning

【ACL2020-浙大-微软】多轮对话推理数据集，MuTual: A Dataset for Multi-Turn Dialogue Reasoning

专知会员服务

37+阅读 · 2020年4月10日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

专知会员服务

31+阅读 · 2020年1月11日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

钙钛矿型太阳能电池稳定性的机理研究及其光伏器件性能提高

国家自然科学基金

0+阅读 · 2015年12月31日

DSSCs阳极散射层用Magneli相TiOx反opals制备及电池高效光电转换机理

国家自然科学基金

0+阅读 · 2014年12月31日

地下水流数值模拟概念模型的不确定性分析

国家自然科学基金

0+阅读 · 2013年12月31日

X射线干涉光刻和谱学方法研制金属等离子太阳能电池

国家自然科学基金

0+阅读 · 2012年12月31日

动力锂离子电池正极材料Li1-xMyVOPO4/C的制备及性能

国家自然科学基金

0+阅读 · 2011年12月31日

Neuron to Graph: Interpreting Language Model Neurons at Scale

Arxiv

0+阅读 · 2023年5月31日

COVID-19 Detection from Mass Spectra of Exhaled Breath

Arxiv

0+阅读 · 2023年5月30日

How Effective Are Neural Networks for Fixing Security Vulnerabilities

Arxiv

0+阅读 · 2023年5月29日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

VIP会员

文章信息

相关主题

多标签分类

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

【ACL2020-浙大-微软】多轮对话推理数据集，MuTual: A Dataset for Multi-Turn Dialogue Reasoning

【ACL2020-浙大-微软】多轮对话推理数据集，MuTual: A Dataset for Multi-Turn Dialogue Reasoning

专知会员服务

37+阅读 · 2020年4月10日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

专知会员服务

31+阅读 · 2020年1月11日

热门VIP内容

开通专知VIP会员享更多权益服务

前沿人工智能趋势报告（Frontier AI Trends Report）

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

音退化问题：基于输入操控的鲁棒语音转换综述

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Neuron to Graph: Interpreting Language Model Neurons at Scale

Arxiv

0+阅读 · 2023年5月31日

COVID-19 Detection from Mass Spectra of Exhaled Breath

Arxiv

0+阅读 · 2023年5月30日

How Effective Are Neural Networks for Fixing Security Vulnerabilities

Arxiv

0+阅读 · 2023年5月29日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

相关基金

钙钛矿型太阳能电池稳定性的机理研究及其光伏器件性能提高

国家自然科学基金

0+阅读 · 2015年12月31日

DSSCs阳极散射层用Magneli相TiOx反opals制备及电池高效光电转换机理

国家自然科学基金

0+阅读 · 2014年12月31日

地下水流数值模拟概念模型的不确定性分析

国家自然科学基金

0+阅读 · 2013年12月31日

X射线干涉光刻和谱学方法研制金属等离子太阳能电池

国家自然科学基金

0+阅读 · 2012年12月31日

动力锂离子电池正极材料Li1-xMyVOPO4/C的制备及性能

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员