Published May 26, 2022 | Version v1
Project deliverable Open

D3.3 – Initial report on the performance characteristics on relevant hardware for upcoming supercomputers

  • 1. KTH Royal Institute of Technology
  • 1. KTH Royal Institute of Technology
  • 2. Universit´e de Versailles Saint-Quentin-en-yvelines (UVSQ)
  • 3. CNRS

Description

This deliverable documents the initial performance analysis results obtained for all 6 TREX flagship applications. The focus was, in particular, the scaling of the application and the ability to exploit parallelism at all the different levels of modern HPC architectures. This ranges from the efficient use of SIMD instructions to the use of highly parallel compute accelerators like Graphics Processing Units (GPUs).

For assessing the applications in terms of scalability, it needs to be taken into account that they differ in terms of their principle ability to be highly parallelized. Some of the applications, e.g. QMC=Chem, implement scalable methods with a strong focus on scalability, which could be demonstrated using up to 32,768 CPU cores. Furthermore, very encouraging results have been obtained for TurboRVB from GPU acceleration using Europe’s currently fastest supercomputer, i.e. JUWELS Booster.

The performance results collected for this deliverable will help to guide further work and optimisations during the second half of the project.

Files

TREX-D3.3 - Initial report on the performance.pdf

Files (2.3 MB)

Additional details

Funding

TREX – Targeting Real chemical accuracy at the EXascale 952165
European Commission