Published September 2, 2012 | Version v1
Working paper Open

An Analysis of FFT Performance in PRACE Application Codes

Authors/Creators

  • 1. STFC Daresbury Laboratory, Warrington, UK
  • 1. STFC Daresbury Laboratory, Warrington, UK
  • 2. Institute of Physics, Belgrade, Serbia
  • 3. Irish Centre for High-End Computing, Dublin, Ireland

Description

The Fast Fourier Transform (FFT) is one of the most widely used algorithms in engineering and scientific applications
and therefore its analysis and performance is of much importance to a range of research fields. On PRACE Tier-0 systems a
parallel environment with a great deal of processing power (large number of CPU cores or accelerators such as GPUs) is at
disposal for researchers. The FFTs investigated are both in-code and through various numerical libraries, where the algorithm is
implemented in both serial and parallel form. The implementations of FFT investigated range from pure MPI, OpenMP versions
for multicore, hybrid (OpenMP/MPI) to GPU-based.
The objective of this project is to assess the suitability, performance and scalability of various implementations of FFT
for the PRACE large-scale scientific applications Quantum ESPRESSO and DL_POLY.

Files

An Analysis of FFT Performance in PRACE Application Codes.pdf

Files (2.4 MB)

Additional details

Funding

European Commission
PRACE-1IP - PRACE - First Implementation Phase Project 261557