Published April 29, 2026 | Version v0.2.0
Software Open

NaviMed-UMB: hardware envelope studies for local AI deployment on consumer RDNA 4 GPUs

Authors/Creators

  • 1. Department of Respiratory Physiopathology, Medical University of Białystok, Poland

Description

An engineering log and benchmark suite documenting the practical envelope of running modern large language models (up to 70B parameters) on a consumer-grade dual AMD Radeon AI PRO R9700 32 GB workstation under ROCm 7.2 and vLLM 0.19. Version 0.1.0 documents the working configurations for Qwen 3.6 27B (released 2026-04-22) on this hardware, including a quantization-performance inversion finding (BF16 outpaces FP8 by approximately 75% under the current software stack) attributable to the absence of R9700-specific FP8 kernel configurations in vLLM. Intended audience includes researchers preparing local AI infrastructure for privacy-sensitive workloads, hardware reviewers seeking reproducible methodology, and software maintainers working on RDNA 4 support in inference frameworks.

Notes

If you use or reference this work, please cite it as below.

Files

kicrazom/navimed-umb-v0.2.0.zip

Files (11.5 MB)

Name Size Download all
md5:bb6ceacc80c415679b3b848f0f61c6f2
11.5 MB Preview Download

Additional details

Related works