Published July 6, 2021 | Version v1
Poster Open

CPU and FPGA performance comparison of a conjugate gradient solver extracted from a molecular dynamics code

  • 1. French National Centre for Scientific Research
  • 2. Maxeler Technologies UK

Description

FPGA devices used in the HPC context promise an increased energy efficiency, enhancing the computing systems Flop/W rate. This work compares an FPGA and a CPU implementation of a conjugate gradient solver in terms of both time to solution and energy to solution metrics. The starting point is MetalWalls, a molecular dynamics code developed at Sorbonne University in Pr. M. Salanne's team, capable of computing accurately the charge and discharge cycles of supercapacitors (energy storing devices). In the context of the H2020 EXA2PRO project, a miniapp has been derived from the F90 pure MPI production code, extracting the core of the electrostatic computation. The FPGA version has been implemented with the Data Flow Engine (DFE) software toolchain developed by Maxeler. Additionally, since FPGAs can perform arithmetic operations with any number of bits instead of the "standard" 32 or 64 bits, the miniapp could be further accelerated using optimised custom number formats. Thanks to an accuracy analysis based on the CADNA tool and comparisons with quadruple precision runs, this acceleration could be achieved without decreasing the computed solution accuracy. Finally, the original CPU and the developed FPGA implementations could be compared on Juelich Computing Centre computing systems.

Files

Poster_PASC2021.pdf

Files (5.9 MB)

Name Size Download all
md5:05a3db757a8d43fe737719146ea6b816
5.9 MB Preview Download

Additional details

Funding

EXA2PRO – Enhancing Programmability and boosting Performance Portability for Exascale Computing Systems 801015
European Commission