Published April 29, 2026
| Version v0.2.0
Software
Open
NaviMed-UMB: hardware envelope studies for local AI deployment on consumer RDNA 4 GPUs
Authors/Creators
- 1. Department of Respiratory Physiopathology, Medical University of Białystok, Poland
Description
An engineering log and benchmark suite documenting the practical envelope of running modern large language models (up to 70B parameters) on a consumer-grade dual AMD Radeon AI PRO R9700 32 GB workstation under ROCm 7.2 and vLLM 0.19. Version 0.1.0 documents the working configurations for Qwen 3.6 27B (released 2026-04-22) on this hardware, including a quantization-performance inversion finding (BF16 outpaces FP8 by approximately 75% under the current software stack) attributable to the absence of R9700-specific FP8 kernel configurations in vLLM. Intended audience includes researchers preparing local AI infrastructure for privacy-sensitive workloads, hardware reviewers seeking reproducible methodology, and software maintainers working on RDNA 4 support in inference frameworks.
Notes
Files
kicrazom/navimed-umb-v0.2.0.zip
Files
(11.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:bb6ceacc80c415679b3b848f0f61c6f2
|
11.5 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/kicrazom/navimed-umb/tree/v0.2.0 (URL)
Software
- Repository URL
- https://github.com/kicrazom/navimed-umb