How Far Have We Gone in Vulnerability Detection Using Large Language Models

Gao, Zeyu; Wang, Hao; Zhou, Yuchen; Zhu, Wenyu; Zhang, Chao

Computer Science > Artificial Intelligence

arXiv:2311.12420 (cs)

[Submitted on 21 Nov 2023 (v1), last revised 22 Dec 2023 (this version, v3)]

Title:How Far Have We Gone in Vulnerability Detection Using Large Language Models

Authors:Zeyu Gao, Hao Wang, Yuchen Zhou, Wenyu Zhu, Chao Zhang

View PDF HTML (experimental)

Abstract:As software becomes increasingly complex and prone to vulnerabilities, automated vulnerability detection is critically important, yet challenging. Given the significant successes of large language models (LLMs) in various tasks, there is growing anticipation of their efficacy in vulnerability detection. However, a quantitative understanding of their potential in vulnerability detection is still missing. To bridge this gap, we introduce a comprehensive vulnerability benchmark VulBench. This benchmark aggregates high-quality data from a wide range of CTF (Capture-the-Flag) challenges and real-world applications, with annotations for each vulnerable function detailing the vulnerability type and its root cause. Through our experiments encompassing 16 LLMs and 6 state-of-the-art (SOTA) deep learning-based models and static analyzers, we find that several LLMs outperform traditional deep learning approaches in vulnerability detection, revealing an untapped potential in LLMs. This work contributes to the understanding and utilization of LLMs for enhanced software security.

Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
Cite as:	arXiv:2311.12420 [cs.AI]
	(or arXiv:2311.12420v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2311.12420

Submission history

From: Hao Wang [view email]
[v1] Tue, 21 Nov 2023 08:20:39 UTC (2,468 KB)
[v2] Wed, 20 Dec 2023 15:48:15 UTC (2,469 KB)
[v3] Fri, 22 Dec 2023 14:07:16 UTC (2,469 KB)

Computer Science > Artificial Intelligence

Title:How Far Have We Gone in Vulnerability Detection Using Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:How Far Have We Gone in Vulnerability Detection Using Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators