Lightweight LLMs for 3GPP Specifications: Fine-Tuning, Retrieval-Augmented Generation and Quantization
Authors/Creators
Description
Interpreting complex 3GPP telecommunications standards for question and answering (QA) poses a challenge for general-purpose LLMs due to their specialized terminology and high computational demands, limiting their use in resource- constrained environments. This work explores an efficient, open- source approach using the TeleQnA dataset of 10,000 telecom questions and the TSpec-LLM repository of processed 3GPP documents.
We enhance a lightweight Llama 3.2 (3B parameters) model, quantized from 16-bit precision to 4 bits, through fine- tuning and RAG to improve accuracy without heavy resource reliance.
Unlike prior resource-intensive or proprietary solutions, our method reduces memory demands, enabling deployment on modest hardware like edge devices or softwarized networks.
Shared via GitHub repositories [1], this approach advances cost- effective, reproducible AI for telecommunications QA, supporting contexts where budgets, computation, or public internet access are limited.
Files
RC3.C5.pdf
Files
(330.6 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:0c85f120182256b1805bbfe81427be00
|
330.6 kB | Preview Download |
Additional details
Funding
- Fundação de Amparo à Pesquisa do Estado de São Paulo
- SMART NEtworks and ServiceS for 2030 (SMARTNESS) 2021/00199-8