Toward Optimizing Reinforcement Learning Workload Placement at the Cloud-Edge Continuum in 6G Networks: A Scaled RL Framework
Authors/Creators
Description
With the increasing deployment of Reinforcement Learning (RL) for network optimization at the edge of wireless
networks, the RL workload emerges as a significant challenge. While the placement of general Machine Learning workloads
across the cloud–edge continuum has been widely studied, existing solutions typically exclude RL techniques due to their
distinct structure and operational requirements. In this work, we propose a framework for RL workload placement in the
cloud–edge continuum, enabling the scaling of RL actor processes across both domains. In this framework, agents that interact with the environment through simple feedback loops are deployed at the edge, while training and model storage are performed in the cloud, where sufficient computational resources are available. We implement and simulate a prototype of one scaled RL actor that performs Quality-of-Service-aware resource block assignment with separate threads for environment interaction, inference, buffering/sampling, and the learning process. Finally, we outline the open challenges of the proposed framework.
Files
Towards Optimizing RL-final.pdf
Files
(655.1 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:49675d1ccf7d596b17f6681cc402c922
|
655.1 kB | Preview Download |
Additional details
Dates
- Accepted
-
2026-05-25ICC2026