Published December 11, 2025 | Version v1

Acceleration of C/C++ Kernels and ONNX Models on CGRAs with MLIR-Based Compilation

Description

Executing Artificial Intelligence (AI) at the edge is challenging due to tight energy and computational constraints. Heterogeneous platforms, particularly those incorporating Coarse-Grained Reconfigurable Arrays (CGRAs), offer a compelling trade-off between hardware specialization and programmability, supporting spatially distributed and energyefficient computation. Despite their potential, the deployment of applications on CGRA accelerators remains limited by the lack of practical toolchains and methodologies. In this work, we propose a compilation flow based on MLIR to enable the seamless integration of both C/C++ kernels and ONNX-based AI models into a RISC-V system augmented with a CGRA accelerator. Our approach extracts the underlying Data Flow Graph (DFG) from the high-level representation. It maps it onto the CGRA using an Integer Linear Programming (ILP) mapper that accounts for the accelerator's architectural constraints. A custom backend completes the toolchain by generating the necessary binaries for coordinated execution across the RISC-V processor and the CGRA. This framework enables the practical deployment of heterogeneous edge workloads, combining the flexibility of software execution with the efficiency of hardware acceleration.

Files

DCIS2025.pdf

Files (720.6 kB)

Name Size Download all
md5:da34561ba49ac40679db12c62e579842
720.6 kB Preview Download

Additional details

Funding

European Commission
A-IQ Ready - Artificial Intelligence using Quantum measured Information for realtime distributed systems at the edge 101096658