Published May 18, 2021 | Version v1
Lesson Open

Node Level Performance Optimization

  • 1. CSC
  • 2. Intel
  • 3. AMD

Description

Description

This course includes topics on code optimization for x86 platforms (Intel and AMD CPUs) and efficient code parallelization on node-level using OpenMP threading. Advanced aspects of threading and optimization, such as selected topics of new features in OpenMP 5.0 standard, will be covered during the course.

Learning outcome

- Awareness of modern features in x86 CPUs;
- Ability to vectorize computations;
- Ability to use advanced features of OpenMP;
- Ability to increase code performance using threading and x86 optimization.

Prerequisites and content level

- Good knowledge of C/C++ or Fortran;
- Good knowledge of threading using OpenMP;
- Basic knowledge of modern CPU architectures.

The content level of the course is broken down as: beginner's - 0%, intermediate - 20%, advanced - 80%, community-targeted content - 0%.

Files

LECTURE slides_PTC Node level performance optimization @ CSC, 18.-20.5.2021.pdf