Persistence of Performance Gaps Between Layer-Specific LoRA and Full Fine-Tuning in Llama-3.2-3B Across Domain Shifts

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20670619

Published June 12, 2026 | Version v1

Report Open

Persistence of Performance Gaps Between Layer-Specific LoRA and Full Fine-Tuning in Llama-3.2-3B Across Domain Shifts

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Large Language Models (LLMs) such as GPT-4 and LLaMA have demonstrated remarkable reasoning abilities but require significant computational resources for fine-tuning. This paper presents a resource-efficient fine-tuning approach for LLaMA-3.2-3B to enhance medical chain-of-thought reasoning while operating under constrained GPU and memory settings. Using parameter-efficient tuning techniques such as LoRA and QLoRA, we adapt the base model on publicly available medical reasoning datasets. The model achieves improved reasoning coherence and factual accuracy while reducing memory usage by up to 6

Research goal: To what extent does the performance gap between layer-specific LoRA injection and full fine-tuning in Llama-3.2-3B persist when evaluated on out-of-domain technical manuals compared to in-domain Kubernetes queries?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.2/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.2/10.

Files

paper.pdf

Files (81.3 kB)

Name	Size	Download all
paper.pdf md5:e78cfa1c0266cdb59bee7e516ab0877a	81.3 kB	Preview Download

	All versions	This version
Views	1	1
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Persistence of Performance Gaps Between Layer-Specific LoRA and Full Fine-Tuning in Llama-3.2-3B Across Domain Shifts

Authors/Creators

Description

Notes

Files

paper.pdf

Files (81.3 kB)