Published February 21, 2024 | Version v1
Preprint Open

YinYangRAN: Resource Multiplexing in GPU-Accelerated Virtualized RANs

  • 1. Universidad Carlos III de Madrid
  • 2. ROR icon IMDEA Networks
  • 3. NEC Laboratories Europe GmbH
  • 4. i2CAT Foundation
  • 5. ICREA

Description

RAN virtualization is revolutionizing the telco industry, enabling 5G Distributed Units to run using general-purpose platforms equipped with Hardware Accelerators (HAs). Recently, GPUs have been proposed as HAs, hinging on their unique capability to execute 5G PHY operations efficiently while also processing Machine Learning (ML) workloads. While this ambivalence makes GPUs attractive for cost-effective deployments, we experimentally demonstrate that multiplexing 5G and ML workloads in GPUs is in fact challenging, and that using conventional GPU-sharing methods can severely disrupt 5G operations. We then introduce YinYangRAN, an innovative O-RAN-compliant solution that supervises GPU-based HAs so as to ensure reliability in the 5G processing pipeline while maximizing the throughput of concurrent ML services. YinYangRAN performs GPU resource allocation decisions via a computationally-efficient approximate dynamic programming technique, which is informed by a neural network trained on real-world measurements. Using workloads collected in real RANs, we demonstrate that YinYangRAN can achieve over 50% higher 5G processing reliability than conventional GPU sharing models with minimal impact on co-located ML workloads. To our knowledge, this is the first work identifying and addressing the complex problem of HA management in emerging GPU-accelerated vRANs, and represents a promising step towards multiplexing PHY and ML workloads in mobile networks.

Files

m38214-lo_schiavo final.pdf

Files (571.7 kB)

Name Size Download all
md5:475d8270f80595c5a6219b697f811fa3
571.7 kB Preview Download

Additional details

Funding

DAEMON – Network intelligence for aDAptive and sElf-Learning MObile Networks 101017109
European Commission
BeGREEN – Beyond 5G Artificial Intelligence Assisted Energy Efficient Open Radio Access Network 101097083
European Commission
ORIGAMI – Optimized resource integration and global architecture for mobile infrastructure for 6G 101139270
European Commission