Published August 23, 2022 | Version v1
Conference paper Open

Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and Algorithm Co-design

  • 1. Imperial College London
  • 2. Samsung AI Center Cambridge
  • 3. University of Cambridge
  • 4. Samsung AI Center Cambridge and Imperial College London
  • 5. Samsung AI Cambridge and University of Cambridge
  • 6. Cornell University

Description

This repo contains the artifacts for our MICRO'22 paper titled "Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and Algorithm Co-design".
The functionality of our hardware accelerator can be evaluated by running Verilog HDL designs and System Verilog testbenches on Vivado design suite.
The accuracy results can be obtained by running our PyTorch programs and the associated Bash scripts.
The power and resource utilization can be obtained by running Synthesis and Implementation using our RTL code and constraint files.
The latency can be obtained by running our custom Python-based performance model.
We also provide all our training log files and Vivado design reports in the link: https://drive.google.com/drive/folders/1zn38AjjQvqHZh-xsmeeIFK2BA-poIRAn?usp=sharing.

Files

NPU_Transformer.zip

Files (1.4 MB)

Name Size Download all
md5:6ff4b82ef8bc133cd7d8ff5da8531595
1.4 MB Preview Download