Published July 21, 2025 | Version v1
Conference paper Open

Lightweight LLMs for 3GPP Specifications: Fine-Tuning, Retrieval-Augmented Generation and Quantization

  • 1. ROR icon Universidade Estadual de Campinas (UNICAMP)
  • 2. ROR icon Universidade Federal do Cariri
  • 3. NeuralMind
  • 4. University of Campinas

Description

Interpreting complex 3GPP telecommunications standards for question and answering (QA) poses a challenge for general-purpose LLMs due to their specialized terminology and high computational demands, limiting their use in resource- constrained environments. This work explores an efficient, open- source approach using the TeleQnA dataset of 10,000 telecom questions and the TSpec-LLM repository of processed 3GPP documents.

We enhance a lightweight Llama 3.2 (3B parameters) model, quantized from 16-bit precision to 4 bits, through fine- tuning and RAG to improve accuracy without heavy resource reliance.

Unlike prior resource-intensive or proprietary solutions, our method reduces memory demands, enabling deployment on modest hardware like edge devices or softwarized networks.

Shared via GitHub repositories [1], this approach advances cost- effective, reproducible AI for telecommunications QA, supporting contexts where budgets, computation, or public internet access are limited.

Files

RC3.C5.pdf

Files (330.6 kB)

Name Size Download all
md5:0c85f120182256b1805bbfe81427be00
330.6 kB Preview Download

Additional details

Funding

Fundação de Amparo à Pesquisa do Estado de São Paulo
SMART NEtworks and ServiceS for 2030 (SMARTNESS) 2021/00199-8