Node Level Performance Optimization
Description
Description
This course includes topics on code optimization for x86 platforms (Intel and AMD CPUs) and efficient code parallelization on node-level using OpenMP threading. Advanced aspects of threading and optimization, such as selected topics of new features in OpenMP 5.0 standard, will be covered during the course.
Learning outcome
- Awareness of modern features in x86 CPUs;
- Ability to vectorize computations;
- Ability to use advanced features of OpenMP;
- Ability to increase code performance using threading and x86 optimization.
Prerequisites and content level
- Good knowledge of C/C++ or Fortran;
- Good knowledge of threading using OpenMP;
- Basic knowledge of modern CPU architectures.
The content level of the course is broken down as: beginner's - 0%, intermediate - 20%, advanced - 80%, community-targeted content - 0%.
Files
LECTURE slides_PTC Node level performance optimization @ CSC, 18.-20.5.2021.pdf
Files
(28.4 MB)
Name | Size | Download all |
---|---|---|
md5:bee1b374f0f0a950c756e0da55a37940
|
28.4 MB | Preview Download |