Published November 8, 2024
| Version v2
Software
Open
Official Implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
Creators
Description
This artifact is the official open-source implementation for ASPLOS 2025 paper "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow". It contains the simulator and the prototype system used in this paper. A detailed step-by-step guide to environment setup and example usage is in the readme file.
Files
Helix-ASPLOS25-v2.zip
Files
(55.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:1ce5fed02c5b6324b4ce38a17ecb5f28
|
55.8 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Publication: arXiv:2406.01566 (arXiv)
Software
- Repository URL
- https://github.com/Thesys-lab/Helix-ASPLOS25
- Programming language
- Python, C++
- Development Status
- Active