Published November 8, 2024 | Version v2
Software Open

Official Implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"

Description

This artifact is the official open-source implementation for ASPLOS 2025 paper "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow". It contains the simulator and the prototype system used in this paper. A detailed step-by-step guide to environment setup and example usage is in the readme file.

Files

Helix-ASPLOS25-v2.zip

Files (55.8 MB)

Name Size Download all
md5:1ce5fed02c5b6324b4ce38a17ecb5f28
55.8 MB Preview Download

Additional details

Related works

Is supplement to
Publication: arXiv:2406.01566 (arXiv)

Software

Repository URL
https://github.com/Thesys-lab/Helix-ASPLOS25
Programming language
Python, C++
Development Status
Active