Published June 6, 2026
| Version v1
Preprint
Open
Reducing Hallucinations in Domain-Specific LLMs via LoRA Fine-Tuning: A Production Case Study
Description
Large language models (LLMs) exhibit strong performance across natural
language tasks, yet their tendency to hallucinate remains a fundamental
barrier to deployment in domain-specific production settings. This paper
presents a production case study demonstrating that LoRA fine-tuning on
12,000 domain documents reduces hallucination rates by 22% relative to
Llama 3.1 8B while cutting monthly inference costs by 58% compared to
GPT-4. Combining LoRA with semantic paragraph-level RAG chunking yields
a 31% improvement in retrieval precision measured by RAGAS faithfulness
scores across 500+ evaluation queries.
Files
Reducing_Hallucinations_in_Domain_Specific_LLMs_via_LoRA_Fine_Tuning__A_Production_Case_Study.pdf
Files
(283.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:4d638bc79346313b78b1090691b20ea0
|
283.7 kB | Preview Download |