Multimodal Fine-Tuned Small Language Models vs. Large Multimodal LLMs in CWE Detection Accuracy
Description
This report synthesises findings from 7 peer-reviewed papers addressing the following research question: How do small language models (SLMs) fine-tuned with multimodal context compare to larger multimodal LLMs in terms of CWE detection accuracy and alignment metrics on the extended Big-Vul dataset. In this paper, we present a challenging code reasoning task: vulnerability detection. Large Language Models (LLMs) have shown promising results in natural-language and math reasoning, but state-of-the-art (SOTA) models reported only 54.5\% Balanced Accuracy in our vulnerability. 6 claims were extracted from source literature; 6 were independently verified against retrieved documents. An automated multi-reviewer quality assessment produced a score of 8.3/10. This report is a machine-generated literature synthesis and does not constitute original research.
Research goal: How do small language models (SLMs) fine-tuned with multimodal context compare to larger multimodal LLMs in terms of CWE detection accuracy and alignment metrics on the extended Big-Vul dataset?
Autonomous literature synthesis. Automated review score: 8.3/10. Full text and citation available at Assignee Research.
Notes
Files
paper.pdf
Files
(81.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:0dd5c49055c932cfda06ff2e73b9e34e
|
81.5 kB | Preview Download |
Additional details
Related works
- Is compiled by
- https://assignee.net (URL)