Published January 7, 2026 | Version v1.0.0
Model Open

UCD Cluster VM Management Model (ONNX)

  • 1. University College Dublin

Description

This is the first stable release of the DRL VM Management Agent, a Deep Reinforcement Learning model trained using Maskable PPO to optimize Virtual Machine placement and lifecycle management.

The model is exported in ONNX format for platform-independent deployment and comes with a full Python inference pipeline.

📦 What's Included

  • Pre-trained Model: vm_management_agent.onnx (Supports up to 32 Hosts).
  • Inference Engine: MLSysOpsVMManagementAgent class for easy integration.
  • State Serializer: Converts complex JSON infrastructure states into the required 801-dim vector.
  • Action Interpreter: Translates integer model outputs into human-readable decisions (e.g., "Create Large VM on Host 5").
  • Demo Suite: Includes demo.py and generate_scenarios.py to verify model behavior.

⚠️ Model Constraints (Model Card)

  • Scope: Single Datacenter.
  • Scale: Maximum 32 Hosts.
  • Resources: Optimization is based on CPU Cores only.
  • Job Limits: Maximum job size is 8 Cores.

🚀 Quick Start

  1. Clone and Install:
    git clone https://github.com/tgasla/MLSysOps-VM-Management-Agent.git
    cd MLSysOps-VM-Management-Agent
    pip install -r requirements.txt
    
  2. Run the Demo:
    python demo.py
    

Full Documentation: See the README.md for integration details.

Files

model-cluster-vm-management.zip

Files (7.7 kB)

Name Size Download all
md5:3e08fa72db298398894bb9bdaecba62e
7.7 kB Preview Download

Additional details

Related works

Funding

European Commission
MLSysOps - Machine Learning for Autonomic System Operation in the Heterogeneous Edge-Cloud Continuum 101092912

Dates

Available
2026-01-06
Initial Public Release

Software

Repository URL
https://github.com/mlsysops-eu/model-cluster-vm-management
Programming language
Python
Development Status
Active