A Dynamic Allocation Scheme for Adaptive Shared-Memory Mapping on Kilo-core RV Clusters for Attention-Based Model Deployment

Bowen Wang; Marco Bertuletti; Yichao Zhang; Victor J.B. Jung; Luca Benini

doi:10.5281/zenodo.17609103

Published November 14, 2025 | Version v1

Conference paper Open

A Dynamic Allocation Scheme for Adaptive Shared-Memory Mapping on Kilo-core RV Clusters for Attention-Based Model Deployment

Attention-based models demand flexible hardware to manage diverse kernels with varying arithmetic intensities and memory access patterns. Large clusters with shared L1 memory, a commonarchitectural pattern, struggle to fully utilize their processing elements (PEs) when scaled up due to reduced throughput in the hierarchical PE-to-L1 intra-cluster interconnect. This paper presents Dynamic Allocation Scheme (DAS), a runtime programmable address remapping hardware unit coupled with a unified memory allocator, designed to minimize data access contention of PEs onto the multi-banked L1. We evaluated DAS on an aggressively scaled-up 1024-PE RISC-V cluster with Non-Uniform Memory Access (NUMA) PE-to-L1 interconnect to demonstrate its potential for improving data locality in large parallel machine learning workloads. For a Vision Transformer (ViT)-L/16 model, each encoder layer executes in 5.67ms, achieving a 1.94× speedup over the fixed word-level interleaved baseline with 0.81 PE utilization. Implemented in 12nm FinFET technology, DAS incurs <0.1% area overhead.

Files

A Dynamic Allocation Scheme for Adaptive.pdf

Files (2.8 MB)

Name	Size	Download all
A Dynamic Allocation Scheme for Adaptive.pdf md5:3aa4c3198d267d0effa88fac9bfd9f9e	2.8 MB	Preview Download

Additional details

European Commission
COREnext - European Core Technologies for Next Generation Communication-Computing Hardware 101092598

	All versions	This version
Views	79	79
Downloads	21	21
Data volume	70.1 MB	70.1 MB

A Dynamic Allocation Scheme for Adaptive Shared-Memory Mapping on Kilo-core RV Clusters for Attention-Based Model Deployment

Authors/Creators

Description

Files

A Dynamic Allocation Scheme for Adaptive.pdf

Files (2.8 MB)

Additional details

Funding