Performance Optimisation on AMD EPYC

Mark Bull; Adrian Jackson

doi:10.5281/zenodo.7566953

Published January 25, 2023 | Version v1

Lesson Open

Performance Optimisation on AMD EPYC

1. EPCC

This course covers techniques for improving the performance of parallel applications by optimising of the code that runs within each node.

Modern HPC systems such as ARCHER2 are being constructed using increasingly powerful nodes, with larger and larger numbers of cores and enhanced vector capabilities. To extract maximum performance from applications, it is therefore necessary to understand, and be able to overcome, on-node performance bottlenecks. This course will cover the main features of modern HPC nodes, including multiple cores, vector floating point units, deep cache hierarchies, and NUMA memory systems. We will cover techniques for efficient programming of these features, using batch processing options and compiler options as well as hand tuning of code. The course will also contain an introduction to the use of Cray performance analysis tools.

Prerequisites:

Participants must have attended ARCHER2 for Software Developers or be familiar with software development on ARCHER, ARCHER2, or any other HPC facility, using C, C++ or Fortran.

This course is targeted at users interested in optimising the performance of their own applications, e.g. through compiler options or code changes.

Users interested in efficient use of centrally installed packages should consider attending Understanding Package Performance instead

Notes

https://www.archer2.ac.uk/training/courses/221213-performance-optimisation

Files

L00-overview.pdf

Files (8.2 MB)

Name	Size	Download all
L00-overview.pdf md5:701fc454688ce16d34e39027609081df	1.3 MB	Preview Download
L01-architecture.pdf md5:89045d8068359e29769de1eea9c84a47	1.0 MB	Preview Download
L02-profiling.pdf md5:1657aa642c8fa57a2f29fdce114c2714	1.0 MB	Preview Download
L03-optwithcomp.pdf md5:c599eeafc33edb8cecd6b2395d23eb5b	632.8 kB	Preview Download
L04-openmpopt.pdf md5:28da4182d52036f1eb94d4d95052df62	609.8 kB	Preview Download
L05-vectorisation.pdf md5:e110b39455d28e8cf766c9f886687f18	660.4 kB	Preview Download
L06-memoryopt.pdf md5:e38cc9a8313d1db32a8eae5ee1cc4455	660.2 kB	Preview Download
single-node-optimisation-course-2022-12-13.zip md5:1f21a34d8ae7dce6fb31244449a8c291	2.3 MB	Preview Download

	All versions	This version
Views	34	34
Downloads	43	43
Data volume	65.0 MB	65.0 MB

Performance Optimisation on AMD EPYC

Creators

Description

Notes

Files

L00-overview.pdf

Files (8.2 MB)