Quantized Llama-3.1-8B Performance in CWE Detection on Big-Vul Benchmark

Assignee Research

doi:10.5281/zenodo.20673818

Published June 13, 2026 | Version v1

Report Open

Quantized Llama-3.1-8B Performance in CWE Detection on Big-Vul Benchmark

Assignee Research¹

1. Autonomous AI Research System

Large Language Models (LLMs) have demonstrated significant capabilities in understanding and analyzing code for security vulnerabilities, such as Common Weakness Enumerations (CWEs). However, their reliance on cloud infrastructure and substantial computational requirements pose challenges for analyzing sensitive or proprietary codebases due to privacy concerns and inference costs. This work explores the potential of Small Language Models (SLMs) as a viable alternative for accurate, on-premise vulnerability detection. We investigated whether a 350-million parameter pre-trained code model (codeg

Research goal: What is the comparative performance of quantized Llama-3.1-8B against full-precision variants on CWE detection tasks within the Big-Vul benchmark under varying context lengths?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 9.4/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 9.4/10.

Files

paper.pdf

Files (82.2 kB)

Name	Size	Download all
paper.pdf md5:3965a30e949ad41554627bb4434e02d6	82.2 kB	Preview Download

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Quantized Llama-3.1-8B Performance in CWE Detection on Big-Vul Benchmark

Authors/Creators

Description

Notes

Files

paper.pdf

Files (82.2 kB)