How does cross-domain fine-tuning on security-specific code corpora affect the F1-score of Llama3 and Codestra

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20441091

Published May 29, 2026 | Version v1

Report Open

How does cross-domain fine-tuning on security-specific code corpora affect the F1-score of Llama3 and Codestra

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Pre-trained models for Natural Languages (NL) like BERT and GPT have been recently shown to transfer well to Programming Languages (PL) and largely benefit a broad set of code-related tasks. Despite their success, most current methods either rely on an encoder-only (or decoder-only) pre-training that is suboptimal for generation (resp. understanding) tasks or process the code snippet in the same way as NL, neglecting the special characteristics of PL such as token types. We present CodeT5, a unified pre-trained encoder-decoder Transformer model that better leverages the code semantics conveyed

Research goal: How does cross-domain fine-tuning on security-specific code corpora affect the F1-score of Llama3 and Codestral in zero-shot vulnerability classification across unseen programming languages compared to general code pre-training?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 9.0/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 9.0/10.

Files

paper.pdf

Files (78.0 kB)

Name	Size	Download all
paper.pdf md5:236b59b0818857efc1c9cd9d199ba726	78.0 kB	Preview Download

	All versions	This version
Views	2	2
Downloads	1	1
Data volume	78.0 kB	78.0 kB

How does cross-domain fine-tuning on security-specific code corpora affect the F1-score of Llama3 and Codestra

Authors/Creators

Description

Notes

Files

paper.pdf

Files (78.0 kB)