Published August 25, 2023 | Version v1
Conference paper Open

GEMM-Like Convolution for Deep Learning Inference on the Xilinx Versal

  • 1. Universitat Politècnica de València
  • 2. Universidad de Córdoba

Description

We revisit a blocked formulation of the direct convolution algorithm that mimics modern realizations of the general matrix multiplication (GEMM), demonstrating that the same approach can be adapted to deliver high performance for deep learning inference tasks on the AI Engine (AIE) tile embedded in Xilinx Versal platforms. Our experimental results on a Xilinx Versal VCK190 shows an arithmetic throughput close to 70% of the theoretical peak of the AIE tile for 8-bit integer operands and the convolutional layers arising in ResNet-50 v.15+ImageNet.

Files

2023_H3_Convolution_on_Versal_cameraReady_arxiv.pdf

Files (304.5 kB)

Additional details

Funding

eFlows4HPC – Enabling dynamic and Intelligent workflows in the future EuroHPCecosystem 955558
European Commission
APROPOS – Approximate Computing for Power and Energy Optimisation 956090
European Commission